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Abstract — Radio Frequency IDentification (RFID) security and 
privacy are exciting research areas that involve affluent 
interactions among many disciplines like signal processing, 
supply-chain logistics, hardware design, privacy rights and 
cryptography. There remain connections to be explored between 
the work surveyed here and other areas of study. This paper 
explores by highlighting a few of these. The majority of the 
articles treated in this survey explore security and privacy as an 
issue between RFID tags and readers and also compare with 
other technologies such as Barcode. Of course, tags and readers 
lie at the periphery of a full-scale RFID system. Many of the 
attendant data-security problems like that of authenticating 
readers to servers involve already familiar data-security 
protocols. This paper also mentions key management, costing, tag 
collision for RFID and identifies PIN distribution for tags as one 
such potential problem. 

Keywords-RFID; Privacy and security; RFID tags; RFID 
readers 



I. 



Introduction 



RFID technology uses radio-frequency waves to 
automatically identify people or objects. There are several 
methods of identification, but the most common is to store a 
serial number that identifies a person or object, and perhaps 
other information, on a microchip that is attached to an 
antenna (the chip and the antenna together are called an RFID 
transponder or an RFID tag). The antenna enables the chip to 
transmit the identification information to a reader. The reader 
converts the radio waves reflected back from the RFID tag 
into digital information that can then be passed on to 
computers that can make use of it. RFID is automatic and fast 
and will replace the barcode system in the near future. The big 
difference between RFID and barcodes is line-of-sight 
technology. That is, a scanner has to see the barcode to read it, 
which means people usually have to adjust the barcode toward 
a scanner for it to be read, RFID by contrast, does not require 
line of sight. RFID tags can be read as long as they are within 
range of a reader. RFID is a proven technology that has been 
around since at least the 1970s. Up until now, it has been too 
expensive and too limited to be realistic for commercial 
applications. But if the cost associated with making tags is 
reduced enough, they can solve many of the problems 
associated with barcodes and bring much more benefit. Both 
the size and cost of RFID tags have been continuously 
decreasing. With potentially significant applications and the 
cheap price of RFID technology, it is predictable that every 
moving object could be tagged in the near future. In recent 



years, since Wal-Mart originated and applied RFID 
technology in supply chain management, RFID has been 
widely used in many different fields such as defence and 
military, postal package tracking, aviation industry, health care 
and baggage and passenger tracing in airport etc. The use of an 
RFID system is appropriate basically everywhere that 
something has to be automatically labelled, identified, 
registered, stored, monitored or transported. RFID systems are 
available in a wide scope. Despite the wide range of RFID 
solutions, each RFID system consists of two components: 
a) a transponder and b) a reader [19]. 

II. Motivation, scope and Limitations 

RFID systems offer improved efficiency in inventory 
control, Library Management, Automation systems, logistics 
and supply chain management. As such, they are of great 
interest to enterprisers intensively reliant on supply chains, 
particularly large retailers and consumer product 
manufacturers. We first want to know its various applications 
in different areas. But without proper protection, wide spread 
embracing of retail RFID could raise privacy concerns for 
everyday consumers. The standard of RFID security system is 
not good enough to protect their system from outside attack. 
Thus the security issues of RFID are an intriguing research 
topic. This paper proposes which type of RFID security 
system is better and highlights the trivial RFID 
communications. 

An organization using the RFID technology has a long term 
goal to integrate RFID on the retail level for better return on 
investment (ROI). RFID modernizes the whole software 
configuration management (SCM) process. And in business 
the company or organization that has modern and up-to-date 
supply chain management is expected to be on top of others. 
On the other hand every educational Institute might build up 
their library management automation system using RFID Tag. 
Section III of this paper describe about the basics of RFID 
system. Here I give a brief idea about RFID components like 
RFID tags, Readers, Antenna etc., the working technology of 
RFID system and some possible types of attack in the RFID 
system. Section IV discusses about the uses and security and 
privacy issues related to RFID system. In section V, I give 
some ideas related to using RFID in the coming days and 
finally conclude the paper in section VI. 

III. Background Study 
RFID system is defined by the following three features: 
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■ Electronic identification: The system makes possible 
an unmistakable labelling of objects by means of 
electronically stored data. 

■ Contact less Data transmission: Data identifying the 
object can be read wirelessly through a radio 
frequency channel. 

■ Transmit when requested (on call): A labelled object 
only transmits data when a matching reader initiates 
this process. In technical terms, an RFID system 
consists of two components: a transponder and a 
reader. 

A. Transponders and Readers 

The transponder also known as a tag acts as the actual data 
carrier. It is applied to an object (for instance, on a good or 
package) or integrated into an object (for instance, in a smart 
card) and can be read without making contact, and rewritten 
depending on the technology used. Fundamentally the 
transponder consists of an integrated circuit and a radio- 
frequency module. An identification number is stored along 
with other data on the transponder and the object with which it 
is connected. The reading unit typically called the reader as in 
the following consists of a reading, in some cases a write/read 
unit and an antenna. The reader reads data from the 
transponder and in some cases instructs the transponder to 
store further data. The reader also monitors the quality of data 
transmission. RFID systems must offer at least the following 
features: 

■ Identify the transponder within a specified range. 

■ Read the data of the transponder. 

■ Select the transponders relevant for the particular 
system. 

■ Guarantee that more than one transponder can be 
managed within the range of the reader. 

■ Have some way to recognize errors in order to 
guarantee operation security. 

B. Modes of transmission 

Two basically different types of procedure are used to 
transmit data between the transponder and a reader: duplex 
procedures including both full duplex (FDX) and half duplex 
(HDX) and sequential systems (SEQ). The full and half duplex 
procedures have in common that the energy transmission 
between reader and transponder is uninterrupted, both in the 
uplink and in the down link, independently of the data 
transmission. With sequential systems on the other hand the 
transponder is supplied with energy only in the pauses in data 
transmission between the tag and the reader. 

RFID systems can be subdivided into three categories by 
their ranges: close-coupling, remote-coupling and long-range 
systems. Close-coupling systems have a range up to one 
centimetre. Close-coupling systems can work with almost any 
frequencies (from low frequency to 30 MHz), depending on 
the coupling used. Remote-coupling systems have a range of 
up to about one meter. They typically work in the frequency 
range below 135 KHz and at 13.56 MHz. The coupling 
between the reader and transponder is done inductively. In 
exceptional cases higher ranges are also possible: 100 meters 
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spectrum around 5.8 GHz, which is currently in a very early 
developmental stage. 



C. Types of attack 

A person who attacks an RFID system may hunt various 
goals, which can be classified as follows: 

■ Spying: The attacker gains unauthorized access to 
information. 

■ Deception: The attacker deceives the operator or user 
of an RFID system by feeding in incorrect 
information. 

■ Denial of Service (DoS): The availability of functions 
of the RFID system is compromised. 

■ Protection of privacy: Because the attacker believes 
that his privacy is threatened by the RFID system, he 
protects himself by attacking the system. 

And some common security measures are security 
precautions, authentication, checking the identity of the tag, 
scrutinizing the identity of the reader, strong mutual 
authentication, encryption, anti-collision protocols that are 
safe from eavesdropping, silent tree- walking etc. 

IV. RFID USES AND SECURITY ISSUES 

I provide some views on security issues concerning RFID 
systems and highlight some of the areas that have to be 
considered regarding this topic. To deal with security and 
RFID means to deal not only with security aspects of RFID 
systems but also with security aspects of anything or anyone 
affected by RFID systems. The widespread diffusion of 
identification technology and storage devices certainly has 
side effects and can lead to new threats in other areas and 
applications [17]. 

A. RFID uses 

Access control and personnel tracking and location systems 
can help to assure the security of restricted areas suppose in 
airports (such as flight lines, baggage handling areas, customs, 
employee lounges), passports, for children's security at 
Schools, Parks, Hospitals and other sensitive areas [9]. 
1. Passports: RFID tags are used in passports issued by many 
countries. The first RFID passports (e-passports) were issued 
by Malaysia in 1998. Malaysian e-passports record the travel 
history (time, data and Place) of entries and exits from the 
country. 

Now some security concerns on the e-passports. When 
New Zealand launched e-passport then a source from U.K. 
mentioned that RFID (radio frequency ID) chips in passports 
can be cracked in as little as 48 hours [20]. British newspaper 
The Guardian reports it was able to access the data stored on 
RFID cards in Britain's newly launched smart passports. 
However, the New Zealand Department of Internal Affairs 
(DIA) says there isn't enough information contained within the 
New Zealand passports' chips to create counterfeit travel 
documents. DIA passport manager David Philip confirmed 
that it is possible to access the information stored on the RFID 
chips and use it to make a clone. However, the RFID chip in 
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the e-passports currently issued in New Zealand is just one 
security feature out of more than 50 contained in the passport 
[20]. Having just a cloned chip is not sufficient to create a 
counterfeit passport, Philip says, and adds that such an end 
eager is quite involved. While New Zealand passports are 
"highly desirable," the DIA has seen very few credible 
counterfeited ones, he says. While the general design goal of 
the e-passport is to lock the holder's identity to the document 
in a secure manner, Philip says that there has to be a balance 
between risk management and customer service [20],[1]. The 
passport has to be readable around the world in a practical 
amount of time and preferably in more situations than just 
immigration. Philip gives airport check-ins as one example of 
where RFID-equipped passports should be readable. Making 
the e-passport harder to read is possible, Philp says, but it 
would make immigration processing take longer and 
inconvenience people. Researcher Peter Gutmann at the 
University of Auckland's department of Computer Science is 
sceptical that the RFID chip provides any real security benefit 
[16]. In fact, Gutmann goes further and says in his technical 
background paper, Why biometrics is not a panacea, that 
RFIDs in passports "are a disaster waiting to happen." German 
and Dutch passports have already been compromised, 
according to Gutmann, and this can be done remotely as well 
[1]. He points to successful attacks by Dutch RFID security 
specialist Harko Robroch, who intercepted passport and reader 
device communications from five meters away. Gutmann says 
eavesdropping on the reader was possible up to 25 meters 
[20]. In comparison, the Guardian article says U.K. passports 
are readable 7.5cm away, a far shorter distance than Robroch's 
interception, but enough in situations such as public transport, 
where people are close together, to draw off the data stored in 
the RFID chip. 

However, Gutmann's worst-case scenario for RFIDs in 
passports occurs not when they are being compromised for 
counterfeiting purposes, but are used to identify the holder. 
The RFID chip could be used to trigger explosive charges and 
Gutmann points to a study that shows the current U.S. passport 
design caused a small, non-lethal explosive charge masked in 
a rubbish tin to detonate. The New Zealand Department of 
Internal Affairs (DIA) confirmed reports it is possible to 
access the information stored on the RFID (radio frequency 
ID) chips in Britain's newly launched e-passports and use it to 
make a clone. But said the danger lies not when they are being 
compromised for counterfeiting purposes, but are used to 
identify the holder. 
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payment and communication systems to support the move 
towards all electronic open road tolling and emerging traffic 
management applications, such as high occupancy tolling 
(HOT) lanes, congestion pricing, dynamic road pricing and 
express lanes, to mitigate bottleneck congestion or increase 
infrastructure capacity during peak usage [2] . 

In this economy, the paper-thin eGo Plus tag, priced under 
$10, provides a significant savings for motorists compared to 
similarly-performing hard case tags that have typically sold for 
$25 to $30 while improving performance capabilities. The 
sticker tag is comparable in size to a vehicle inspection sticker 
and mounts easily on a motorist's windshield. The slim form 
factor also increases point of purchase options making it 
adaptable to retail outlets and more easily accessible beyond 
traditional toll customer service centers. Early users of the eGo 
Plus windshield sticker tag technology experienced two to four 
times the expected motorist adoption rate, quickly establishing 
that the paper-thin tag could aid in overcoming deployment 
barriers that previously hindered widespread motorist use. The 
paper-thin, battery less ego technology provides 
environmental value as well. 

■ Increasing wireless payment of tolls reduces 
congestion and eliminates idle times at toll plazas, 
lowering vehicle emissions and improving air quality. 
By eliminating barriers to adoption, as seen with eGo 
Plus tags, more motorists will use this form of 
wireless payment. 

■ The smaller profile tag consumes less petroleum 
based raw material to manufacture and reduces 
transportation and shipping requirements. 

■ The battery less design of the tag eliminates the 
additional cost and demand for batteries and 
subsequent storage and disposal requirements. 



The latest in the line of RFID products is the eZGo 
Anywhere tag. To advance inter operability for electronic toll 
collection systems nationwide, the eZGo Anywhere standard 
onboard unit (OBU) is designed to simplify wireless payment 
of tolls for motorists that travel across states and require 
different tags for each region's toll system, such as a motorist 
from the Northeast who travels south to Florida or west to 
Texas. With these orders, TransCore's eGo family of tags 
exceeds 11.3 million transponders shipped while globally 
TransCore's RFID technology deployed in various 
transportation applications in 41 countries exceeds 35 million 
RFID tags and 55,000 readers. 



2. Transport payments: Throughout Europe and in 
Particular in Paris in France ( system started in 1995 by the 
RATP), Lyon and Marseille in France, porto and Lisbon in 
Portugal, Milan and Torino in Italy, Brussels in Belgium, 
RFIF passes conforming to the Calypso(RFID) international 
standard are used for public transport systems. They are also 
used now in Canada, Mexico, Israel, Bogota and Pereira in 
Colombia, Scavenger in Norway, etc. Today, the shift to an 
almost cashless culture has transportation authority's 
deploying technology that can accelerate the use of wireless 



3. RFID for Library: Among the many uses of RFID 
technologies is its deployment in libraries. This technology 
has slowly began to replace the traditional barcodes on library 
items (books, CDs, DVDs, etc.). Some handy information 
about RFID for Library is as follows: 

a. RFID tags replace both the EM security strips and 
Barcode. 

b. Simplify patron self check-out / check-in. 

c. Ability to handle material without exception for video 
and audio tapes. 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



d. Radio Frequency anti-theft detection is innovative 
and safe. 

e. High-speed inventory and identify items which are 
out of proper order. 

/ Long-term development guarantee when using Open 
Standard. 

RFID is the latest technology to be used in library theft 
detection systems. Unlike EM (Electro-Mechanical) and RF 
(Radio Frequency) systems, which have been used in libraries 
for decades, RFID-based systems move beyond security to 
become tracking systems that combine security with more 
efficient tracking of materials throughout the library, including 
easier and faster charge and discharge, inventorying, and 
materials handling [7]. 

RFID is a mixture of radio-frequency-based technology 
and microchip technology. The information contained on 
microchips in the tags a fixed to library materials is read using 
radio frequency technology regardless of item orientation or 
alignment (i.e., the technology does not require line-of- sight or 
a fixed plane to read tags as do traditional theft detection 
systems) and distance from the item is not a critical factor 
except in the case of extra- wide exit gates [14]. The corridors 
at the building exit(s) can be as wide as four feet because the 
tags can be read at a distance of up to two feet by each of two 
parallel exit sensors [14], [8]. The targets used in RFID 
systems can replace both EM or RF theft detection targets and 
barcodes. 

Advantages of RFID system in library 

a. Rapid check-out / check-in- The use of RFID reduces 
the amount of time required to perform circulation operations. 
The most significant time savings are attributable to the facts 
that information can be read from RFID tags much faster than 
from bar codes and that several items in a stack can be read at 
the same time [11]. While initially unreliable, the anti- 
collision algorithm that allows an entire stack to be check-out 
or check-in now appears to be working well. The other time 
savings realized by circulation staff are modest unless the 
RFID tags replace both the EM security strips or RF tags of 
older theft detection systems and the barcodes of the library 
management system - i.e., the system is a comprehensive 
RFID system that combines RFID security and the tracking of 
materials throughout the library; or it is a hybrid system that 
uses EM for security and RFID for tracking, but handles both 
simultaneously with a single piece of equipment [14]. There 
can be as much as a 50 percent increase in throughput. The 
time savings are less for check-out than for check-in because 
the time required for check-out usually is extended by social 
interaction with patrons. For patrons using self check out, 
there is a marked improvement because they do not have to 
carefully place materials within a designated template and they 
can check out several items at the same time. Patron self 
check-in shifts that work from staff to patrons. Staff is relieved 
further when readers are installed in book drops [2]. 
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b. High reliability- 
i. The readers are highly reliable. RFID library systems 

claim an almost 100 percent detection rate using RFID 
tags. 

ii. There is no false alarm than with older technologies once 
an RFID system is properly tuned. 

iii. RFID systems encode the circulation status on the RFID 
tag. This is done by designating a bit as the "theft" (EAS) 
bit and turning it off at time of check-out and on at time of 
check-in. If the material that has not been properly check- 
out is taken past the exit sensors, an immediate alarm is 
triggered [3]. 

c. High-speed inventorying - 
A unique advantage of RFID systems is their ability to scan 
books on the shelves without tipping them out or removing 
them. A hand-held inventory reader can be moved rapidly 
across a shelf of books to read all of the unique identification 
information. Using wireless technology, it is possible not only 
to update the inventory, but also to identify items which are 
out of proper order [19]. 

d. Automated materials handling- 
Another application of RFID technology is automated 
materials handling. This includes conveyer and sorting 
systems that can move library materials and sort them by 
category into separate bins or onto separate carts. This 
significantly reduces the amount of staff time required to 
ready materials for re- shelving. 

e. Long tag life- 
Finally, RFID tags last longer than barcodes because nothing 
comes into contact with them. Most RFID vendors claim a 
minimum of 100,000 transactions before a tag may need to be 
replaced. 

4. RFID for children's security at Schools, Parks, Swimming 
pools etc.: Traditionally, school facilities have been 
characterized as easily accessible, open to anyone seeking 
access. The historical absence of security threats facilitated 
this culture of openness, which schools have been reluctant to 
abandon even in the face of changing circumstances like 
terrorism. Due to the ongoing global terrorism, organized 
kidnapping of rich people's children and in the face of other 
social critical issues, the traditional school security systems 
have proven to be insufficient. The same is applicable in other 
institution and infrastructure for children's like Park, 
swimming pools etc. Therefore, RFID being the only 
technology capable to tracking and identifying any person on 
the move provides a perfect school and children's security 
model in the current and future security context. School 
authorities in the Japanese city of Osaka are now chipping 
children's clothing, back packs, and student IDs in a primary 
school [23]. 

A school in Doncaster, England is piloting a monitoring 
system designed to keep tabs on pupils by tracking radio chips 
in their uniforms [21]. St Charles Sixth Form College in West 
London, England, started September, 2008, is using an RFID 
card system to check in and out of the main gate, to both track 
attendance and prevent unauthorized entrance. As is Whitcliffe 
Mount School in Cleckheaton, England which uses RFID to 
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track pupils and staff in and out of the building via specially 
designed cards. In the Philippines, some schools already use 
RFID in IDs for borrowing books and also gates in those 
particular schools have RFID ID scanners for buying items at 
a school shop and canteen, library and also to sign in and sign 
out for student and teacher's attendance. These schools are 
Claret School of Quezon City, Colegio de San Juan de Letran, 
San Beda College and other private Schools [21]. 
Benefits of RFID security at schools 

Parent & authority will always be informed about a kid's 
or student's real time location wherever they are inside the 
school, park or swimming pool area. School, Park and 
authority will be notified and or alarmed if any student or kids 
wants to go away from the authorized area or premise. 
Students and kid's wristband tag or RFID card will be always 
visible for visual check by security personnel and can be read 
by RFID system automatically in the entrance and exit of the 
school or parks [11]. Parents or authorized guardians will also 
carry the authorized ID for children's that will authorize them 
to enter the school or park designated area. School authority 
will be able to track and monitor attendance of students at 
School at real time [4]. Parents/guardians can be notified 
instantly and automatically in case of his/her child goes out of 
school before scheduled time. It also helps the teachers to 
record exact time of student' s attendants for yearly review of a 
student's discipline [4]. 

5. Employee tracking and attendance time: All printed photo 
IDs are subject to counterfeiting, alteration, duplication, and 
forgery. Deltech's secured photo ID card gives you highest 
level of security so you know who's who. Deltech's RFID 
technology makes photo ID and access control easy while 
taking security a step further, ensuring that only authorized 
individuals are able to access your office or secured building 
and only authorized person/authority can view the details of a 
person of your organization [4] . Whether you need to identify 
people or control access, Deltech's ID card gives you the peace 
of mind that the right people have access to the right places. 
We can also provide and integrate Biometric finger print or 
face recognition based access control along with RFID [4] . 

Benefits of using Deltech RFID Employee tracking Systems: 

You can track your employee or staff no matter where 
they are in the building you can monitor when your staff or 
employee gets into your office building and when they exit 
and you can monitor the real time attendance/location of your 
employee from anywhere in the world [5]. In case of any 
emergency you will be able to read the employees/staff's 
urgent information on the move using any mobile reader 
(optional). I.e. an employee meets an accident during the work 
or in the area of operation. His/her encoded information in the 
ID card gives you an easy access to his/her all required 
information including blood group, address etc without 
exposing the information to unwanted people [21]. An 
unauthorized person will not be able to access the restricted 
areas. Visitors will not be able to leave the premises without 
returning the visitors card on the security gate/counter. Auto 
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payroll system can be managed based on the employee work 
hour performed as reported by the RFID System. 



6. RFID in Hospitals: 

Deltech RFID system can be used to track patients, doctors, 
nurses and expensive equipment in hospitals in real time. 
RFID tags can be attached to the ID bracelets of all patients, or 
just patients requiring special attention, so their location can 
be tracked constantly [4], [5]. Deltech RFID technology can 
also provide an electronic link for wirelessly communicating 
patient data. An instant assessment of critical equipment and 
personnel locations is also possible through RFID technology. 
Del Technology Limited has implemented RFID solution at 
Apollo Hospitals Dhaka to be able to identify the location of 
Doctors, Nurses and other employees and also to be able to 
register each patients using RFID tags that stores patient's data 
into the RFID chip. RFID Systems of Deltech implemented at 
Apollo Hospitals Dhaka, Bangladesh. These applications can 
be combined with Deltech RFID and or Biometric access 
control to allow only authorized personnel to access to critical 
areas of the hospital [6]. 

Benefits of using Deltech RFID Systems for Hospitals: 
Continuously track each patient's location. 
Track the location of doctors and nurses in the 
hospital. 

Track the location of expensive and critical 
instruments and equipment. 

Restrict access to drugs, paediatrics, and other high- 
threat areas to authorized staff. 

observe and track unofficial persons who are loitering 
around high-threat areas. 

Facilitate triage processes by restricting access to 
authorized staff and "approved" patients during 
medical emergencies, epidemics, terrorist threats, and 
other times when demands could threaten the 
hospital's ability to effectively deliver services [6]. 
■ Use the patient's RFID tag to access patient 
information for review and update through a hand- 
held computer. 

7. Animal Identification: 

The National Animal Identification System (NAIS) is a 
government-run system in United States to identify animals 
and the premises where they have been, in order to provide the 
potential to identify and isolate threatening diseases. The cattle 
system is expected to use individual identification with 
information of the animals' current and previous locations and 
dates of transfer, sent to a central database. The details of a 
national plan are still being developed and debated, and 
changes may occur before finalized. This factsheet is an 
attempt to help producers understand the NAIS as proposed 
and interpreted. RFID tags for animals represent one of the 
oldest uses of RFID technology. Originally meant for large 
ranches and rough terrain, since the outbreak of Mad Cow 
Disease, RFID has become crucial in animal identification 
management. A variety of RFID tags or transponders can also 
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be used for animal identification. The transponders are more 
well-known as passive RFID technology, or simply "Chips" on 
animals [24]. 



8. Human identification: 

The success of various animal identification uses since the 
early 1990s has spurred RFID research into various human 
tracking alternatives. Impart-able RFID chips designed for 
animal tagging are now being used in humans. An early 
experiment with RFID implants was conducted by British 
professor of cybernetics Kevin Warwick, who implanted a 
chip in his arm in 1998. In 2004 Conrod Chase offered 
implanted chips in his night clubs in Barcelona and Rotterdam 
to identify their VIP customers, who in turn use it to pay for 
drinks. In 2004, the Mexican Attorney General's office 
implanted 1 8 of its staff members with the Verichip to control 
access to a secure data room [18]. 

Security experts have warned against using RFID for 
authenticating people due to the risk of identity theft. For 
instance a man-in-the-middle attack would make it possible 
for an attacker to steal the identity of a person in real-time. 
Due to the resource constraints of RFIDs it is virtually 
impossible to protect against such attack models as this would 
require complex distance-binding protocols [18] [15] [13] [22]. 
Privacy advocates have protested against Impart-able RFID 
chips, warning of potential abuse and denouncing these types 
of RFID devices as "spychips," and that use by governments 
could lead to an increased loss of civil liberties and would lend 
itself too easily to abuse. One such case of this abuse would be 
in the microchip's dual use as a tracking device. Such concerns 
were justified in the United States, when the FBI program 
COINTELPRO was revealed to have tracked the activities of 
high profile political activist and dissident figures. 

There is also the possibility that the chip's information 
will be available to those other than governments, such as 
private business, thus giving employers highly delicate 
information about employees. In addition, privacy advocates 
state that the information contained in this chip could easily be 
stolen, so that storing anything private in it would be to risk 
identity theft. According to the FDA, implantation of an RFID 
chip poses potential medical downsides. Electrical hazards, 
MRI incompatibility, adverse tissue reaction, and migration of 
the implanted transponder are just a few of the potential risks 
associated with the Verichip ID implant device, according to 
an October 12, 2004 letter issued by the Food and Drug 
Administration (FDA). 

9. RFID in inventory system: 

An advanced automatic identification technology such as the 
Auto-ID based on the RFID technology as two values for 
inventory systems is already in use. First, the visibility 
provided by this technology allows an accurate knowledge on 
the inventory level by eliminating the discrepancy between 
inventory record and physical inventory. In an academic study 
performed at Wal-Mart, RFID reduced out of Stocks by 30 
percent for products selling between 0.1 and 15 units a day. 
Second, the RFID technology can prevent or reduce the 
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sources of errors. Benefits of using RFID include the reduction 
of labour costs, the simplification of business processes and 
the reduction of inventory inaccuracies. Wal-Mart and the 
United States Department of Defence have published 
requirements that their vendors placed RFID tags on all 
shipments to improve supply chain management. 



10. Other RFID Uses 

NADRA (National Database and Registration Authority) in 
Pakistan has developed an RFID-based driving license that has 
bears the license holders personal information and stores data 
regarding traffic violations, tickets issued and outstanding 
penalties. The license cards are designed so that driving rights 
can be evoked electronically in case of serious violations. 
Sensors such as seismic sensors may be read using RFID 
transceivers, greatly simplifying remote data collection. In 
august 2004, the Ohio Department of Rehabilitation and 
Correction (ODRH) approved a $415000 contract to evaluate 
the personal tracking technology of Alanco Technologies. 
Inmates will wear wristwatch- sized transmitters that can detect 
attempted removal and alert prison computers. Facilities in 
Michigan, California and Illinois already employ the 
technology. RFID in designed by Vita Craft, is an automatic 
cooking device that has three different sized pans, a portable 
induction heater and recipe cards. Each pan is embedded with 
a RFID tag that monitors the food 16 times per second while a 
MI tag in the handle of the pans transmits signals to the 
induction heater to adjust the temperature. 

B. Security issues 

Some problems with RFID are reported during the use. RFID 

problems can be divided into several categories: 

■ Technical problems with RFID 

■ Privacy and ethics problems with RFID. 

Technical problems with RFID 

RFID has been implemented in different ways by 
different manufacturers; global standards are still being 
worked on. It should be noted that some RFID devices are 
never meant to leave their network (as in the case of RFID 
tags used for inventory control within a company). This can 
cause problems for companies. Consumers may also have 
problems with RFID standards. For example, ExxonMobil's 
Speed Pass system is a proprietary RFID system; if another 
company wanted to use the convenient Speed Pass (say, at the 
drive-in window of your favourite fast food restaurant) they 
would have to pay to access it - an unlikely scenario. On the 
other hand, if every company had their own "Speed Pass" 
system, a consumer would need to carry many different 
devices with them. 

An RFID system can utilize a few standards. The problem 
has been that there is no one universally accepted standard. 
Competing standards have been one of the more difficult 
issues for RFID, and as a result, most RFID applications have 
been closed systems. Standards and specifications may be set 
at the international, national, industry or trade association 
level, and individual organizations may term their own 
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specifications as standard. Many industry standards and 
specifications set by individual organizations are based on 
international standards to make implementation and support 
easier and to provide a wider choice of available products. 
Standards can be applied to include the format and content of 
the codes placed on the tags, the protocols and frequencies that 
will be used by the tags and readers to transmit the data, the 
security and tamper-resistance 52 of tags on packaging and 
freight containers, and applications use. The two largest 
drivers for RFID today are Wal-Mart and the Department of 
Defense (DOD). Both have issued mandates for their top 
suppliers to use RFID technology when shipping products to 
their distribution centers. They are both looking to accomplish 
the same thing, but have a slightly different long-term outlook 
[10]. 

The ISO (International Standards Organization) and the 
EPC (Electronic Product Code) Global have both been leading 
figures in this debate. The ISO has their 18000 standard and 
the EPC Global Center has introduced the EPC standard. Wal- 
Mart has decided to use the EPC standard, where the DOD 
wants to use the EPC for general purposes, but use the ISO 
standard for air interface. This is putting a lot of pressure on 
the ISO and EPC to come to some kind of an agreement. EPC 
standard for air interface is not compatible with the ISO 18000 
UHF (Part 6) standard. Both, the EPC and ISO 18000 (Part6) 
standards, deal with the tracking of merchandiser through the 
supply chain. This is WalMarts and the Department of 
Defenses primary focus at this time. The ISO 18000 (Part 6) 
standard only deals with air interface protocols, whereas the 
EPC standard also includes data structure. The desire is for 
these two protocols not to be mutually exclusive. There are 
several evolutions to the EPC standard. Class 1 -Generation 1 
is the current version of EPC. It is not backward compatible 
with Class 0. Generation 2 was hoped to be backward 
compatible with Class but merging with the ISO 18000 
standard will be difficult, if not impossible. Wal-Mart has said 
it will support both Class and 1 but wants to settle on Class 1 
Generation 2 when it is finalized. The EPC standard was 
originally developed for carton and pallet tracking within the 
supply chain. 

Privacy and ethics problems with RFID 

The following problems with RFID tags and readers have 
been reported. 

■ The contents of an RFID tag can be read after the 
item leaves the supply chain. 

■ An RFID tag cannot tell the difference between 
one reader and another. RFID scanners are very portable; 
RFID tags can be read from a distance, from a few inches to a 
few yards. This allows anyone to see the contents of your 
purse or pocket as you walk down the street. Some tags can be 
turned off when the item has left the supply chain. 

■ RFID tags are difficult to remove RFID tags are 
difficult to for consumers to remove; some are very small (less 
than a half-millimetre square, and as thin as a sheet of paper) - 
others may be hidden or embedded inside a product where 
consumers cannot see them. New technologies allow RFID 
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tags to be "printed" right on a product and may not be 
removable at all. 

■ RFID tags can be read without your knowledge 
Since the tags can be read without being swiped or obviously 
scanned (as is the case with magnetic strips or barcodes), 
anyone with an RFID tag reader can read the tags embedded in 
your clothes and other consumer products without your 
knowledge. For example, you could be scanned before you 
enter the store, just to see what you are carrying. You might 
then be approached by a clerk who knows what you have in 
your backpack or purse, and can suggest accessories or other 
items. 

■ RFID tags can be read a greater distances with a 
high-gain antenna For various reasons, RFID reader/tag 
systems are designed so that distance between the tag and the 
reader is kept to a minimum (see the material on tag collision 
above). However, a high-gain antenna can be used to read the 
tags from much further away, leading to privacy problems. 

■ RFID tags with unique serial numbers could be 
linked to an individual credit card number. At present, the 
Universal Product Code (UPC) implemented with barcodes 
allows each product sold in a store to have a unique number 
that identifies that product. Work is proceeding on a global 
system of product identification that would allow each 
individual item to have its own number. When the item is 
scanned for purchase and is paid for, the RFID tag number for 
a particular item can be associated with a credit card number. 



Authentication 

When authentication is carried out, the identity of a 
person or a program is checked. Then, on that basis, 
authorization takes place, i.e. rights, such as the right of access 
to data, are granted. In the case of RFID systems, it is 
particularly important for tags to be authenticated by the 
reader and vice-versa. In addition, readers must also 
authenticate themselves to the backend, but in this case there 
are no RFID- specific security problems. Checking the Identity 
of the tag When the RFID system detects a tag, it must check 
its identity in order to ascertain if the tag has the right to be 
part of the system at all. 

A worldwide and unambiguous regulation for issuing ID 
numbers, as proposed, for example, in the form of the 
Electronic Product Code (EPC), offers a certain amount of 
protection from falsified tags. At the very least, the appearance 
of numbers that were never issued or of duplicates (cloning) 
can be recognized in certain applications. In addition, 
authentication may take place via the challenge-response 
system, in which the reader sends a random number or a time 
stamp to the tag (challenge) which the tag returns in encrypted 
form to the reader (response). The key used in this case is a 
jointly known secret by means of which the tag proves its 
identity. The decisive element in this procedure is the fact that 
the key itself is never transmitted and that a different random 
number is used for every challenge. As a result, the reader 
cannot be deceived by the communication being recorded and 
replayed (replay attack). This unilateral authentication 
procedure is defined as a "symmetric-key two-pass unilateral 
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authentication protocol" in ISO Standard 9798. An attacker 
would have to get hold of the key which is stored both on the 
tag and in the back end of the RFID system. In order to do so, 
it would be necessary to decode the response data that were 
transmitted in encrypted form, which is a very complex if not 
almost impossible task, depending on the length of the key. In 
principle, the key could also be read by physical means from 
the storage cells of the chip, but this would require very 
complicated laboratory methods, such as the "Focused Ion 
Beam" (FIB) technique. In this procedure, an ion beam 
removes very thin layers (a few layers of atoms) in separate 
steps so that the contents can be analyzed microscopically. 

C. Data protection and privacy 

The progressive implementation of RFID systems is being 
keenly followed by the public and mass media and is a topic of 
controversial discussion. From a social point of view, 
guarantees of privacy and various aspects of data protection 
play an ever-increasing role in this controversy (catch- words: 
naked customer or naked citizen). Civil rights organizations 
have published a common position paper on the use of RFID 
and the associated risks posed to data privacy. The signatory 
organizations acknowledged that there can be justified 
interests in the use of RFID on the part of business but, in light 
of the considerable risks involved, they called for dealers and 
manufacturers to observe a voluntary moratorium on the use 
of RFID for consumer goods until all risks were reviewed in a 
comprehensive technology assessment that would propose 
possible counter strategies. 

Anyone with an appropriately equipped scanner and close 
access to the RFID device can activate it and read its contents. 
Obviously, some concerns are greater than others. If someone 
walks by your bag of books from the bookstore with a 13.56 
MHz "sniffer" with an RF field that will activate the RFID 
devices in the books you bought, that person can get a 
complete list of what you just bought. That is certainly an 
invasion of your privacy, but it could be worse. Another 
scenario involves a military situation in which the other side 
scans vehicles going by, looking for tags that are associated 
with items that only high-ranking officers can have, and 
targeting accordingly. Companies are more concerned with the 
increasing use of RFID devices in company badges. 

Along with privacy, consumers want a complex and 
constantly shifting mix of low prices, convenience, 
customization, quality, customer service, and other 
characteristics in their goods and services. Radio frequency 
identification technology will help producers, marketers, and 
retailers better understand and serve the mix of interests 
consumers have. The components that go into RFID readers 
and tags are simple radio communications, but their smaller 
size and broad deployment enhance the power of the 
technology and raise concerns about the privacy effects of 
RFID deployment. These concerns are often premised on 
unlikely assumptions about where the technology will go and 
how it will be used. Any inclination to abuse RFID technology 
will be hemmed in by a variety of social forces, economic 
forces being one of the most significant. The typical RFID tag 
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in the consumer goods environment will be cheap, dumb, and 
not good for much more than tracking inventory. Consumers, 
as economic actors, have substantial power to dictate in the 
give and take of the market how RFID will be used. They will 
likely demand tags linking to their identities in certain 
applications such as consumer electronics but may object to 
the presence of RFID tags in other situations. They may 
demand peel-off tags, or assurances about what a particular tag 
is doing. In many instances, they will be indifferent, and 
rationally so. Regulators, think-tank analysts, and activists 
should not attempt to dictate RFID policy before real 
experience has been gained and must not set up moratorium on 
RFID deployment. 

D. Comparison with Barcode 

Advantages of RFID Versus Barcodes RFID tags and 
barcodes both carry information about products. However, 
there are important differences between these two 
technologies. Barcode readers require a direct line of sight to 
the printed barcode; RFID readers do not require a direct line 
of sight to either active RFID tags or passive RFID tags. RFID 
tags can be read at much greater distances; an RFID reader can 
pull information from a tag at distances up to 300 feet. The 
range to read a barcode is much less, typically no more than 
fifteen feet. RFID readers can interrogate, or read, RFID tags 
much faster; read rates of forty or more tags per second are 
possible. Reading barcodes is much more time-consuming; 
due to the fact that a direct line of sight is required, if the items 
are not properly oriented to the reader it may take seconds to 
read an individual tag. Barcode readers usually take a half- 
second or more to successfully complete a read. 

Line of sight requirements also limit the ruggedness of 
barcodes as well as the reusability of barcodes. Since line of 
sight is required for barcodes, the printed barcode must be 
exposed on the outside of the product, where it is subject to 
greater wear and tear. RFID tags are typically more rugged, 
since the electronic components are better protected in a 
plastic cover. RFID tags can also be implanted within the 
product itself, guaranteeing greater ruggedness and reusability. 
Barcodes have no read/write capability; that is, you cannot add 
to the information written on a printed barcode. RFID tags, 
however, can be read/ write devices; the RFID reader can 
communicate with the tag, and alter as much of the 
information as the tag design will allow. RFID tags are 
typically more expensive than barcodes, in some cases, much 
more so [12]. RFID and barcodes are similar in that they are 
both data collection technologies. However, they differ 
significantly in many areas. Although this comparison 
primarily focuses on the advantages of RFID over barcodes, 
RFID will probably not completely replace barcode 
technology. Barcodes offer some advantages over RFID, most 
notably the low cost. A tabular comparison for RFID and 
Barcode is given in Table I- 



TABLE I. 


COMPARISON OF RFID AND BARCODE 




RFID 


Barcode 


Read range 


Passive RFID: 


Several inches up 
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Up to 40 feet (fixed 


to 




readers). 


several feet. 




Up to 20 feet (handheld 






readers) 






Active RFID: 






Up to 100's 






of feet or more. 




Line of sight 


Not required 
(in most cases). 


Required. 


Type of 


Can uniquely identify 


Can typically only 


identification 


each item/asset tagged. 


identify the type of 
item (UPC Code) 
but not uniquely. 


Read/Write 


Many RFID tags are 
Read/Write. 


Read only. 


Read rate 


10's, 100's or 1000's 
simultaneously. 


Only one at a time. 


Technology 


RF (Radio Frequency). 


Optical (Laser). 


Interference 


Like the TSA 


Obstructed 




(Transportation 


barcodes cannot be 




Security 


read (dirt covering 




Administration), some 


barcode, torn 




RFID frequencies do 


Barcode etc.). 




not like Metal and 






Liquids. They can 






cause interference with 






certain RF Frequencies. 




Automation 


Most "fixed" readers do 


Most barcode 




not require human 


scanners require a 




involvement to collect 


human to operate 




data (automated). 


(labour intensive). 



V. Thought of next generation 

Some vendors have been combining RFID tags with sensors of 
different kinds. This would allow the tag to report not simply 
the same information over and over, but identifying 
information along with current data picked up by the sensor. 
For example, an RFID tag attached to a leg of lamb could 
report on the temperature readings of the past 24 hours, to 
ensure that the meat was properly kept cool. Over time, the 
proportion of "scan-it-yourself" aisles in retail stores will 
increase. Eventually, we may wind up with stores that have 
mostly "scan-it-yourself" aisles and only a few checkout 
stations for people who are disabled or unwilling [12]. 

RFID tags come in a wide variety of shapes and sizes; 
they may be encased in a variety of materials: Animal tracking 
tags, inserted beneath the skin, can be rice-sized; Tags can be 
screw- shaped to identify trees or wooden items; Credit-card 
shaped for use in access applications. The antitheft hard plastic 
tags attached to merchandiser in stores are also RFID tags. 
Heavy-duty 120 by 100 by 50 millimetre rectangular 
transponders are used to track shipping containers, or heavy 
machinery, trucks, and railroad cars. Many musical 
instruments are stolen every year. For example, custom-built 
or vintage guitars are worth as much as $50,000 each. Snagg, a 
California company specializing in RFID microchips for 



International Journal of Computer Science and Information Security, 

Vol 8, No. 8, November 2010 
instruments, has embedded tiny chips in 30,000 Fender guitars 
already [23]. 

The smallest tags that will likely be used for consumer 
items do not have enough computing power to do data 
encryption to protect your privacy. The most they can do is 
PIN-style or password-based protection [20]. Civil liberties 
groups (among others) have become increasingly concerned 
about the use of RFIDs to track the movements of individuals. 
For example, passports will soon be required to contain some 
sort of RFID device to speed border crossings. Scanners 
placed throughout an airport, for example, could track the 
location of every passport over time, from the moment you left 
the parking lot to the moment you got on your plane. In June, 
the Japanese government passed a draft RFID Privacy 
Guideline that stated the following: " Indication that RFID 
tags exist "Consumers right of choice regarding reading tags 
"Sharing information about social benefits of RFID, etc. 
"Issues on linking information on tags and databases that store 
privacy information." Restrictions of information gathering 
and uses when private information is stored on tags " Assuring 
accuracy of information when private information is stored on 
tags "Information administrators should be encouraged" 
Information sharing and explanation for consumers. 

There was a recent report revealing clandestine tests at a 
Wal-Mart store where RFID tags were inserted in packages of 
lipstick, with scanners hidden on nearby shelves. When a 
customer picked up a lipstick and put it in her cart, the 
movement of the tag was registered by the scanners, which 
triggered surveillance cameras. This allowed researchers 750 
miles away to watch those consumers as they walked through 
the store, looking for related items. Contact less Credit Card 
Advantages Credit card companies are claiming the following 
advantages for contact less credit cards: The card is faster to 
use. To make a purchase, the card owner just waves his card 
over the RFID reader, waits for the acceptance indicator - and 
goes on his way. American Express, Visa and Master card 
have all agreed to waive the signature requirement for contact 
less credit card transactions under $25. If we want to look at 
the numbers, here is where this technology is taking us in our 
need for speed (average transaction speeds): 

1. Contact less credit card transaction: 15 seconds 

2. Magnetic strip card transaction: 25 seconds 

3. Cash transaction: 34 seconds 

The contact less cards use highly secure data transmission 
standards. Contact less cards make use of the most secure 
encryption standards practical with current technology. 128-bit 
and triple DES encryption make it nearly impossible for 
thieves to steal your data. 

Contact less Credit Card Disadvantages 

Contact less cards are more exposed than regular credit 
cards. If you want to keep your credit card secure, you could 
keep it safely in an enclosed wallet or purse; thieves would 
have absolutely no way to even know if you have a credit 
card. However, a thief armed with a suitable reader, within a 
few feet of you, would be able to interrogate all of the cards in 
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your wallet or purse without your knowledge. Also, a regular 
credit card transaction is fairly secure; the magnetic strip is 
swiped at very close range (less than a millimetre). However, a 
thief with a suitable reader could monitor your contactless 
card transaction while standing at the counter with you, or just 
behind you. These concerns have, of course, been carefully 
noted by credit card companies. The RFID chip in the contact 
less credit card responds to the merchant reader with a unique 
number used for that transaction only; it does not simply 
transmit the consumer's account number. This number is also 
encrypted. It is easier to spend. Studies have demonstrated that 
consumers will be more likely to spend, and will spend more 
frequently, with contact less credit cards [12]. 

VI. Conclusion 

Finally I would like to conclude by mentioning some future 
challenges regarding RFID technology. Challenges will arise 
from the flexibility of changes in tag ownership. Today, 
domain names, for example, do not change hands very often; 
the DNS can involve human intermediated access-control. 
Another important aspect of RFID security is that of user 
perception of security and privacy in RFID systems. As users 
cannot see RF emissions, they form their impressions based on 
physical cues and industry explanations. RFID will come to 
secure ever more varied forms of physical access and logical 
access. Every technology has some advantages and 
disadvantages; but RFID technology has so far showed a lot of 
potential to be a topic on which intense research can be carried 
upon. 
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Abstract — Researchers introduce in this paper, an efficient fuzzy 
time series forecasting model based on fuzzy clustering to handle 
forecasting problems and improving forecasting accuracy. Each 
value (observation) is represented by a fuzzy set. The transition 
between consecutive values is taken into account in order to 
model the time series data. Proposed model employed eight main 
steps in time-invariant fuzzy time-series and time-variant fuzzy 
time series models to increase the performance of the proposed 
fuzzy time series model. The method of FCMI is integrated in the 
processes of fuzzy time series to partition datasets. The proposed 
model has been implemented to forecast the world production of 
iron and steel and the enrollments of the University of Alabama. 
The proposed model provide higher accuracy in forecasting. Our 
results show that this approach can lead to satisfactory 
performance for fuzzy time series 



Keywords- forecasting; fuzzy Clustering; fuzzy time series; iron. 



I. 



Introduction 



Traditional forecasting methods can deal with many 
forecasting cases, but they cannot solve forecasting problems in 
which the historical data are linguistic values. Song and 
Chissom [12] presented the concept of fuzzy time series based 
on the historical enrollments of the University of Alabama. 
They presented the time-invariant fuzzy time series model and 
the time-variant fuzzy time series model based on the fuzzy set 
theory for forecasting the enrollments of the University of 
Alabama. 

The fuzzy forecasting methods can forecast the data with 
linguistic values. Fuzzy time series do not need to turn a non- 
stationary series into a stationary series and do not require more 
historical data along with some assumptions like normality 
postulates. Although fuzzy forecasting methods are suitable for 
incomplete data situations, their performance is not always 
satisfactory [9,11]. 

Huarng [6] proposed heuristic models; by integrating 
problem-specific heuristic knowledge to improve forecasting. 
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Tsaur, et al [14] proposed an analytical approach to find 
the steady state of fuzzy relation matrix to revise the logic 
forecasting process. Based on the concept of fuzziness in 
Information Theory, the concept of entropy is applied to 
measure the degrees of fuzziness when a time-invariant relation 
matrix is derived. In order to show the forecasting 
performance, the best fitted regression equations are applied to 
compare with the proposed method. 

Yu [15] proposed weighted models to tackle two issues in 
fuzzy time series forecasting; namely, recurrence and 
weighting. Weighted fuzzy time series models appear quite 
similar to the weight functions in local regression models; 
however, both are different. The local regression models focus 
on fitting using a small portion of the data, while the fuzzy 
relationships in weighted fuzzy time series models are 
established using the possible data from the whole of the 
database. 

Jilani and Burney [7] presented two new multivariate 
fuzzy time series forecasting methods. These methods assume 
m-factors with one main factor of interest. Stochastic fuzzy 
dependence of order k is assumed to define general methods of 
multivariate fuzzy time series forecasting and control. 

Cheng et al [4] proposed a novel multiple-attribute fuzzy 
time series method based on fuzzy clustering. The methods of 
fuzzy clustering were integrated in the processes of fuzzy time 
series to partition datasets objectively and enable processing of 
multiple attributes. 

Abd Elaal et al [1-2] proposed a novel forecasting fuzzy 
time series model depend on fuzzy clustering for improving 
forecasting accuracy. Kai et al [8] proposed a novel forecasting 
model for fuzzy time series using K-means clustering 
algorithm for forecasting. 

In this paper, researchers propose an efficient fuzzy time 
series forecasting model based on fuzzy clustering to handle 
forecasting problems and improving forecasting accuracy. Each 
value (observation) is represented by a fuzzy set. The transition 
between consecutive values is taken into account in order to 
model the time series data. 
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II. Related works 



In this section, two related works including: fuzzy clustering 
and fuzzy time series. 

A. Fuzzy clustering (FCMI) 

Fuzzy c-means (FCM) is a method of clustering which 
allows one piece of data to belong to two or more clusters. 
Fuzzy C-Mean Iterative assume that: the existence of pattern 
space X={x 1? x 2 ,..., x m ) and c fuzzy clusters, whose centers 
have initial values y 10 , y2o,--- 5 y c o- Every iteration the 
membership function values updated and the cluster centers 
also. The process terminates when the difference between two 
consecutive clusters centers do not exceed a given tolerance 
[5]. 
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defined by the fuzzy set f x (t). If F(t) is a collection of fi(t), f 2 (t). 
. . then F(t) is defined as a fuzzy time-series on Y (t) (t = . . . , 
0,1,2,...). 



Ak) 



.,(*) 



(1) 



Definition 2. (FTSRs) If there exists a fuzzy logical 
relationship R(t - 1, t), such that F(t) = F(t - 1) x R(t - 1, t), 
where "x" represents an operation, then F(t) is said to be 
induced by F(t - 1). The logical relationship between F(t) and 
F(t- 1) is F(t-l)->F(t). 

Definition 3. (FLR) suppose F(t - 1) = A, and F(t) = Aj . The 
relationship between two consecutive observations, F(t) and F(t 
- 1), referred to as a fuzzy logical relationship, can be denoted 
by Ai -> Aj , where Ai is called the Left-Hand Side (LHS) and 
Aj the Right-Hand Side (RHS) of the FLR. 

Definition 4. (FLRG) All fuzzy logical relationships in the 
training dataset can be grouped together into different fuzzy 
logical relationship groups according to the same Left-Hand 
Sides of the fuzzy logical relationship. For example, there are 
two fuzzy logical relationships with the same Left-Hand Side 



Fuzzy clustering is carried out through an iterative optimization (A . ): A .^ Aji and A . ^ A ^ These two fuzzy logical 



of the objective function iJ , with the update of membership 



U 



u and the cluster centers ^ l by: 



,(*) 



( jW\ 



v o J 



2l(j3-\) 



(2) 



relationships can be grouped into a fuzzy logical relationship 
group Ai-> A^ A j2 . 

Definition 5. (IFTS & VFTS) Assume that F(t) is a fuzzy time- 
series and F(t) is caused by F(t - 1) only, and F(t) = F(t - 1) x 
R(t - 1, t). For any t, if R(t - 1, t) is independent oft, then F(t) 
is named a time-invariant fuzzy time-series, otherwise a time- 
variant fuzzy time-series. 



y; 



E u 

7 = 1 



(k) x 






(3) 



a) Song and Chissom model 

Song and Chissom employed five main steps in time- 
invariant fuzzy time-series and time-variant fuzzy time series 
models as follows: 



This iteration will stop when 









111 



(4) 



B. Fuzzy time series 

Song and Chissom [13] presented the concept of fuzzy 
time series based on the historical enrollments of the University 
of Alabama. Fuzzy time series used to handle forecasting 
problems. They presented the time-invariant fuzzy time series 
model and the time-variant fuzzy time series model based on 
the fuzzy set theory for forecasting the enrollments of the 
University of Alabama. The definitions and processes of the 
fuzzy time-series presented by Song and Chissom are described 
as follows [6,12]. 

Definition 1. (FTS) Assume Y (t) (t = . . 0, 1, 2, . . .) is a subset 
of a real numbers. Let Y (t) be the universe of discourse 



Step 1: Define the universe of discourse U. Define the universe 
of discourse for the observations. According to the issue 
domain, the universe of discourse for observations is defined 
as, 

U=[D mm -D 1 ,D max + D 2 ] (5) 

where, D mm is the minimum value, 
D max is the maximum value, 
Di, D 2 is the positive real numbers. 

Step 2: Partition universal of discourse U into equal intervals. 

Step 3: Define the linguistic terms. Each linguistic observation, 
A k can be defined by the intervals U!,u 2 ,...,u n , as follows: 



A,. 



1 0.5 

— + — 

Ul u 2 

0.5 ■ 1 ■ 0.5 

u k -i u k u k+1 

0.5 1 

u n _i u n 



k=l 

2< k<n-l 

k=n 



(6) 
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Step 4: Fuzzify the historical data. Each historical data can be 
fuzzified into a fuzzy set. 



A. Evaluating of the proposed model 



Step 5: Build fuzzy logic relationships. Build fuzzy logic 
relationships. Two consecutive fuzzy sets Ai(t-l)and Aj(t) can 
be established into a single FLR as Ai~> Aj. 



III. Proposed model 

In this section we proposed an efficient fuzzy time series 
forecasting model based on fuzzy clustering to handle 
forecasting problems and improving forecasting accuracy. 
Most researchers have been taken the same way according to 
processes of the fuzzy time-series, which are presented by Song 
and Chissom, but we introduce a novel model based on fuzzy 
clustering to determine the membership values not as Song and 
Chissom model, and to increase the performance. Proposed 
model employed eight main steps in time-invariant fuzzy time- 
series and time-variant fuzzy time series models as follows: 

Step 1: Cluster data into c clusters: Apply fuzzy clustering 
on a time series Y(t) with n observation to cluster this time 
series into c (2 < c < n) clusters. FCMI is used because it is the 
most popular one and well known in fuzzy clustering field. 

Step 2: Determine membership values for each cluster: In 

this step, membership values is determining after doing fuzzy 
cluster. The proposed model selected the maximum 
membership grade of each value for each cluster which it 
belong to. 

Step 3: Rank each cluster: Proposed model ranking clusters 
by the center of each cluster, where first cluster has the 
minimum center, and last cluster has the maximum center. 

Step 4: Define the universe of discourse U: In this step, the 
proposed model defines the universe of discourse as Song and 
Chissom were defined it as in (5). 

Step 5: Partition universal of discourse U into equal 
intervals: According to this step, the proposed model, partition 
the universe of discourse into c intervals. 

Step 6: Fuzzify the historical data: In this step, proposed 
model fuzzufy historical data, where the proposed model 
determine the best fuzzy cluster to each actual data 

Step 7: Build fuzzy logic relationships: Proposed model in 
this step build fuzzy logic relationship as definition 3. if F(t-l) 
= Ai and F(t) = Aj then the relationship between two 
consecutive observations: Ai -> Aj 

Step 8: Calculate forecasting outputs: The forecasting value 
for each cluster is calculated by proposed model as: 



fore caste (A.) 



df x x X x +df x x Xj+ ... +df m x X n 

W~fc 

7=1 J 



(7) 



Where dfj is the membership grade, 
Xj is the actual value. 



To evaluating the performance of the proposed model, the 
researchers compare the forecasting values of enrollments of 
the University of Alabama with some famous models such as 
Jilani and Burney [7], Tsaur and Yang [14], Yu [15], Kai et al 
[8], and Cheng, etal[4]. 

The forecasting accuracy is compared by using (NRMSE) 
Normalized Root Mean Square Error. NRMSE, in statistic is 
the square root of the sum of the squared deviations between 
actual and predicted values divided by the sum of the square of 
actual values. 



^ {actual . - predict . ) 2 



NRMSE -- 



^ (actual t ) 2 



(8) 



In this study, to evaluate the forecasting accuracy of the 
proposed model, the researchers use the enrollments of the 
University of Alabama as the forecasting target in the existing 
forecasting models. 



Based on the enrollments of the University of Alabama 
from 1971 to 1992, we can get the universe of discourse 
U=[13055,19337], partition U into 7 equal intervals, D!=13, 
and D 2 =55. Hence, the intervals are ui; u 2 ; u 3 ; u 4 ; u 5 ; u 6 ; u 7 ; 
where :- 

U!=[13024.00, 13933.71] 

u 2 =[13933.71, 14843.43], 

u 3 =[14843.43, 15753.14], 

u 4 =[15753.14, 16662.86], 

u 5 =[16662.86, 17572.57], 

u 6 =[17572.57, 18482.29], 

u 7 =[18482.29, 19392.00], 

Table I lists the enrollment of the University of Alabama 
from 1971 to 1992, and membership grades of enrollments for 
each linguistic. Define the fuzzy set Ai using the linguistic 
variable "Enrollments of the University of Alabama", let A x = 
(very very few), A 2 = (very few), A 3 = (few), A 4 = (moderate), 
A 5 = (many), A 6 = (many many), A 7 = (too many). The 
proposed model selected the maximum membership grade for 
each cluster, the forecasting value for each cluster calculating 
as in (7): 



fore caste (A,) 



forecaste (A~) 



lx(1972) 
1 



13563 



0.8 x (1984) 
0.8 



15145 
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forecaste (A~ ) ; 



lx(1975) +lx(1982) 



: 15446 



forecaste (^ 4 ) = iiLi^Z?I = 15861 



forecaste (A~) : 



forecaste (A.) ■- 



lx(1979) 
1 

lx(1988) 
1 



16807 



18150 



forecaste (A q ) = lx (* 989 ) = 18970 



TABLE I 


Data of enrollments of the university of Alabama 






AND MEMBERSHIP GRADES. 








Year 


Actual 


Ai 


A 2 


A 3 


A 4 


A 5 


A 6 


A 7 


1971 


e » 


0.8 


0.1 

















1972 


13563 


1 




















1973 


13867 


0.9 


0.1 

















1974 


14696 


0.1 


0.7 


0.2 


0.1 











1975 


15460 








1 














1976 


15311 





0.1 


0.9 














1977 


15603 





0.1 


0.6 


0.3 











1978 


15861 











1 











1979 


16807 














1 








1980 


16919 














0.9 








1981 


16388 








0.1 


0.3 


0.6 








1982 


15433 








1 














1983 


15497 








0.9 


0.1 











1984 


15145 





0.8 


0.2 














1985 


15163 





0.7 


0.2 














1986 


15984 











0.9 











1987 


16859 














1 








1988 


18150 

















1 





1989 


18970 




















1 


1990 


19328 




















0.9 


1991 


19337 




















0.9 


1992 


18876 

















0.1 


0.9 













Years 


Values 




• 






1971 


13055 


A{1) 


13563 


1972 


13563 


A(l) 


13563 


No. Cluster 7 


1973 


13867 


A(l) 


13563 


N Iteration 1 100 
Tolerance 0.0005 


19"4 


14696 


A(2) 


15145 


1975 


15460 


A(3) 


15446 


1976 


15311 


A(3) 


15446 


1977 


15603 


A(3) 


15446 


1578 


15861 


A(4) 


15861 


Beta I 1.999 




16807 


A(5) 


16833 






1980 


16919 


A(5) 


16833 


19S1 


16388 


A(4) 


15861 


1982 


15433 


A(3) 


15446 


1983 


15497 


A(3) 


15446 


1984 


15145 


A(3) 


15446 


1985 


15163 


A(3) 


15446 


1986 


15984 


A(4) 


15861 


1987 


16859 


A(5) 


16833 














New 


Model 




Proposed 


Exit 



















TABLE II. Data enrollments the university of Alabama, 

LINGUISTIC VALUES, AND FORECASTED VALUES 

Forecasted 

13563 
13563 
13563 
15145 
15446 
15446 
15446 
15861 
16833 
16833 
15861 
15446 
15446 
15446 
15446 
15861 
16833 
18150 
18970 
18970 
18970 
18970 



Comparisons of the forecasting results of different models 



^ears 


Enrollments 


Lingui 


1971 


13055 


Ai 


1972 


13563 


A : 


1973 


13867 


A : 


1974 


14696 


A 2 


1975 


15460 


A 3 


1976 


15311 


A 3 


1977 


15603 


A 3 


1978 


15861 


A 4 


1979 


16807 


A 5 


1980 


16919 


A 5 


1981 


16388 


A 4 


1982 


15433 


A 3 


1983 


15497 


A 3 


1984 


15145 


A 3 


1985 


15163 


A 3 


1986 


15984 


A 4 


1987 


16859 


As 


1988 


18150 


A 6 


1989 


18970 


A 7 


1990 


19328 


A 7 


1991 


19337 


A 7 


1992 


18876 


A 7 




Years 



Figure 2. Forecasting results curve of enrollments of the university of 
Alabama 



The forecasting value for year 1971 is 13563 while the 
actual value was 13055. Fig.l and Table II show linguistic 
terms and forecasting values deduced by proposed model. 



Figure 1 . Forecasting enrollments of the University of Alabama by the 
proposed model 
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NRMSE for different models 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol 8, No. 8, 2010 
also shows that, the proposed model can further improve the 
forecasting results than the other model. 



Fig. 3 shows the comparisons among the existing models 
by using NRMSE, where Jilani and Burney [7] model has 0.02, 
Tsaur and Yang [14] model has 0.025, Yu [15] model has 
0.026, Kai et al [8] model has 0.024, Cheng, et al [4] model has 
0.028 and proposed model has 0.015. 



ni2008 Tsaur 2005 



ai 2010 Proposed 



Figure 3. NRMSE-chart for the existing models and the proposed model 



The line-chart comparison in Fig. 2 shows that the proposed 
model has higher accuracy than the other models. And the 
empirical comparison among the existing models in Table III 



TABLE III. 



Forecasting enrollments of the university of Alabama 



Year 



Actual 
enrollments 



Tsaur and 
Yang 
(2005) 



Yu 

(2005) 



Jilani and 
Burney 
(2008) 



Cheng et al 
(2008) 



Kai et al 
(2010) 



Proposed 



1971 


13055 


13934 


13934 


13769 






13563 


1972 


13563 


13934 


13934 


13769 


14242 


13997 


13563 


1973 


13867 


13934 


13934 


13769 


14242 


13997 


13563 


1974 


14696 


15298 


15298 


14360 


14242 


13997 


15145 


1975 


15460 


15753 


15623 


15271 


15474.3 


15461.2 


15446 


1976 


15311 


15753 


15623 


15271 


15474.3 


15461.2 


15446 


1977 


15603 


15753 


15623 


15271 


15474.3 


15461.2 


15446 


1978 


15861 


16208 


16511 


16182 


15474.3 


15461.2 


15861 


1979 


16807 


17118 


17269 


17094 


16146.5 


16861.7 


16833 


1980 


16919 


17118 


17269 


17094 


16988.3 


17394 


16833 


1981 


16388 


16208 


16511 


16182 


16988.3 


17394 


15861 


1982 


15433 


15753 


15623 


15271 


16146.5 


15461 


15446 


1983 


15497 


15753 


15623 


15271 


15474.3 


15461.2 


15446 


1984 


15145 


15753 


15623 


15271 


15474.3 


15461.2 


15446 


1985 


15163 


15753 


15623 


15271 


15474.3 


15461.5 


15446 


1986 


15984 


16208 


16511 


16182 


15474.3 


15461.5 


15861 


1987 


16859 


17118 


17269 


17094 


16146.5 


16861.7 


16833 


1988 


18150 


18937 


18937 


18004 


16988.3 


17394 


18150 


1989 


18970 


18937 


18937 


18624 


19144 


18932.2 


18970 


1990 


19328 


18937 


18937 


18624 


19144 


18932.2 


18970 


1991 


19337 


18937 


18937 


18624 


19144 


18932.2 


18970 


1992 


18876 


18937 


18937 


18624 


19144 


18932.2 


18970 




NRMSE 


0.025 


0.026 


0.02 


0.028 


0.024 


0.015 
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TABLE IV. Data of the world production of iron and steel, and 

MEMBERSHIP GRADES. 



Based on the data of the iron and steel production witch are 
provided by the International Iron and Steel Institute in 
Brussels, Belgium, and publications of the U.S. geological 
survey from 1975 to 2008 (production values in thousand 
metric tons), we can get the universe of discourse U=[457000, 
954000], partition U into 7 equal intervals, D^OOO, and 
D 2 =7000. Hence, the intervals are ui; u 2 ; u 3 ; u 4 ; u 5 ; u 6 ; u 7 ; 
where :- 



ui=[ 451000.00, 523857.14] 
u 2 =[ 523857.14, 596714.29], 
u 3 =[ 596714.29, 669571.43], 
u 4 =[ 669571.43, 742428.57], 
u 5 =[ 742428.57, 815285.71], 
u 6 =[ 815285.71, 888142.86], 
u 7 =[ 888142.86, 961000.00], 





Y r ears 


Values 


Linguistic 


Forecastsh 


"3 




1975 


479000 


A(l) 


494875 




1976 


498000 


A(l> 


494875 




No. Cluster p 7 


1977 


48 8 000 


A(l> 


494875 




N Iteration 100 


1978 


506000 


A(l) 


494875 




1979 


532000 


A(2) 


537250 






1980 


514000 


A(l) 


494875 










19S1 


502000 


A(l) 


494875 






1982 


457000 


A(l) 


494875 




Beta 1.55 




1 


1983 


463000 


A(l) 


494875 






1984 


495000 


A(l> 


494875 






1985 


499000 


4(1) 


494875 






1986 


495000 


A(l> 


494875 






1987 


509000 


A(l) 


494875 






1988 


539000 


A(2) 


537250 






1989 


546000 


A(2> 


537250 






1990 


531000 


A(2) 


537250 






1991 


509000 


A(l) 


494875 


A 



New 



Model 



Proposed 



Exit 



Figure 4. Forecasting of the world production of iron and steel by the 
proposed model 

Table IV lists the World Production of Iron and Steel from 
1975 to 2008, and membership grades of enrollments for each 
linguistic. Define the fuzzy set Aj using the linguistic variable 
"World Production of Iron and Steel", let A x = (very very few), 
A 2 = (very few), A 3 = (few), A 4 = (moderate), A 5 = (many), A 6 
= (many many), A 7 = (too many). 



Year 


Production 


Ax 


A 2 


A3 


A 4 


A 5 


A 6 


A 7 


1975 


479000 


1 




















1976 


498000 


1 




















1977 


488000 


1 




















1978 


506000 























1979 


532000 





1 

















1980 


514000 























1981 


502000 


1 




















1982 


457000 























1983 


463000 























1984 


495000 


1 




















1985 


499000 


1 




















1986 


495000 


1 




















1987 


509000 























1988 


539000 





1 

















1989 


546000 





1 

















1990 


531000 





1 

















1991 


509000 























1992 


503000 


1 




















1993 


507000 























1994 


516000 























1995 


536000 





1 

















1996 


516000 























1997 


540000 





1 

















1998 


535000 





1 

















1999 


539000 





1 

















2000 


573000 








1 














2001 


585000 








1 














2002 


608000 








1 














2003 


673000 











1 











2004 


720000 











1 











2005 


802000 














1 








2006 


881000 

















1 





2007 


954000 




















1 


2008 


932000 




















1 



Fig. 4 and Table V show linguistic terms and forecasting 
values deduced by proposed model. The forecasting value for 
year 1975 is 494875 while the actual value was 479000 and the 
forecasting value for year 2008 is 943000 while the actual 
value was 932000. 



The proposed model selected the maximum membership 
grade for each cluster, the forecasting value for each cluster 
calculating as in (7): 
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fore caste (A,) 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol 8, No. 8, 2010 
1 x (1975)+! x (1976)+! x (1977)+! x (1981)+! x (1984)+! x (1985)+! x (1986)+! x (1992) 



, t N lx(1979)+lx(1988)+lx(1989)+lx(1990)+lx(1995)+lx(1997)+lx(1998)+lx(1999) 
jorecaste^Ar. ) = h = 537250 



forecaste (A^ ) 



1 x (2000) +1 x (2001) +1 x (2002) 



588667 



forecaste {A.) 



1 x (2003) +1 x (2004) 



696500 



forecaste (A $ ) = - X ^° 5 ^ = 802000 



forecaste (A^) 



1 x (2006) 

1 



881000 



forecaste (A- ) 



1 x (2007) +1 x (2008) 



943000 



TABLE V. Data of the world production of iron and steel, 

LINGUISTIC VALUES, AND FORECASTED VALUES 



Year 


Production 


Linguistic 


Forecasted 


1975 


479000 


Aj 


494875 


1976 


498000 


Ai 


494875 


1977 


488000 


Ai 


494875 


1978 


506000 


A : 


494875 


1979 


532000 


A 2 


537250 


1980 


514000 


Ai 


494875 


1981 


502000 


Ai 


494875 


1982 


457000 


Ai 


494875 


1983 


463000 


A : 


494875 


1984 


495000 


A : 


494875 


1985 


499000 


A : 


494875 


1986 


495000 


A : 


494875 


1987 


509000 


Ai 


494875 


1988 


539000 


A 2 


537250 


1989 


546000 


A 2 


537250 


1990 


531000 


A 2 


537250 


1991 


509000 


A : 


494875 


1992 


503000 


A : 


494875 


1993 


507000 


A : 


494875 


1994 


516000 


A : 


494875 


1995 


536000 


A 2 


537250 


1996 


516000 


Ai 


494875 


1997 


540000 


A 2 


537250 


1998 


535000 


A 2 


537250 


1999 


539000 


A 2 


537250 


2000 


573000 


A 2 


537250 


2001 


585000 


A 2 


537250 


2002 


608000 


A 3 


588667 


2003 


673000 


A 4 


696500 


2004 


720000 


A 4 


696500 


2005 


802000 


A 5 


802000 


2006 


881000 


A 6 


881000 


2007 


954000 


A 7 


943000 


2008 


932000 


A 7 


943000 



The researchers used famous models: Huarng[6], Tsaur and 
Yang [14], Yu [15], Jilani and Burney [7] to test the proposed 
model by forecasting of the world production of iron and steel 
as in Table VI. 



Comparisons of the forecasting results of different models 



^^% 




-Actual 

■ Huarng 2001 

-Tsaur 2005 

Yu2005 

Jilani 2008 

Proposed 



o5° C# (§? <* c^ 



V V V V 



Figure 5. Forecasting results curve of the world production of iron and steel 



NRMSE for different models 



Yu 2005 
Models 



ilani 2008 Proposed 



Figure 6. NRMSE-chart for the existing models and the proposed 
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The line-chart comparison in Fig. 5 shows that the proposed 
model has higher accuracy than the other models. And the 
empirical comparison among the existing models in Table VI 
also shows that, the proposed model can further improve the 
forecasting results than the other model. 



Fig. 6 shows the comparisons among the existing 
models by using NRMSE, where Huarng[6] model has 0.0496, 
Tsaur and Yang [14] model has 0.0598, Yu [15] model has 
0.0551, Jilani and Burney [7] model has 0.0399, and proposed 
model has 0.0296. 





TABLE VI. Forecasting of the world production of iron and steel 




Year 


Actual 


Huarng 2001 


Tsaur 2005 


Yu 2005 


Jilani 2008 


Proposed 


1975 


479000 


504571 


523857 


510762 


509514 


494875 


1976 


498000 


504571 


523857 


510762 


509514 


494875 


1977 


488000 


504571 


523857 


510762 


509514 


494875 


1978 


506000 


504571 


523857 


510762 


509514 


494875 


1979 


532000 


545714 


560286 


560286 


555508 


537250 


1980 


514000 


504571 


523857 


510762 


509514 


494875 


1981 


502000 


504571 


523857 


510762 


509514 


494875 


1982 


457000 


504571 


523857 


510762 


509514 


494875 


1983 


463000 


504571 


523857 


510762 


509514 


494875 


1984 


495000 


504571 


523857 


510762 


509514 


494875 


1985 


499000 


504571 


523857 


510762 


509514 


494875 


1986 


495000 


504571 


523857 


510762 


509514 


494875 


1987 


509000 


504571 


523857 


510762 


509514 


494875 


1988 


539000 


545714 


560286 


560286 


555508 


537250 


1989 


546000 


545714 


560286 


560286 


555508 


537250 


1990 


531000 


545714 


560286 


560286 


555508 


537250 


1991 


509000 


504571 


523857 


510762 


509514 


494875 


1992 


503000 


504571 


523857 


510762 


509514 


494875 


1993 


507000 


504571 


523857 


510762 


509514 


494875 


1994 


516000 


504571 


523857 


510762 


509514 


494875 


1995 


536000 


545714 


560286 


560286 


555508 


537250 


1996 


516000 


504571 


523857 


510762 


509514 


494875 


1997 


540000 


545714 


560286 


560286 


555508 


537250 


1998 


535000 


545714 


560286 


560286 


555508 


537250 


1999 


539000 


545714 


560286 


560286 


555508 


537250 


2000 


573000 


545714 


560286 


560286 


555508 


537250 


2001 


585000 


545714 


560286 


560286 


555508 


537250 


2002 


608000 


706000 


706000 


706000 


628923 


588667 


2003 


673000 


742429 


742429 


754571 


702221 


696500 


2004 


720000 


742429 


742429 


754571 


702221 


696500 


2005 


802000 


851714 


851714 


851714 


775435 


802000 


2006 


881000 


924571 


924571 


924571 


848587 


881000 


2007 


954000 


924571 


924571 


924571 


898939 


943000 


2008 


932000 


924571 


924571 


924571 


898939 


943000 


NRMSE 


0.0496 


0.0598 


0.0551 


0.0399 


0.0296 
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V. Discussion and conclusion 



The research proposed an efficient fuzzy time series 
forecasting model based on fuzzy clustering with high 
accuracy. The method of FCMI is integrated in the processes of 
fuzzy time series to partition datasets. Experimental results of 
enrollments of the University of Alabama, and the comparison 
between the existing models: Jilani and Burney [7], Tsaur and 
Yang [14], Yu [15], Kai et al [8], and Cheng, et al [4] and the 
proposed model show that, the proposed model can further 
improve the forecasting results than the other models and also 
the experimental results of the world production of iron and 
steel, and the comparison between the existing models: 
Huarng[6], Tsaur and Yang [14], Yu [15], Jilani and Burney [7] 
and the proposed model show that, the proposed model has 
higher accuracy than the other models. 
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Abstract — Human body anthropometric measurement is widely 
used in daily life and has become an indispensable item for people 
and garment manufactures. Two-dimensional image-based 
anthropometric measurement system provides another choice to 
make anthropometric measurement alternative to traditional and 
three-dimension methods. These measurement systems are 
attractive because of their lower cost and easier to use. Although 
these systems have appeared in this type of application, most 
systems require the user to wear as little as possible for reducing 
the errors which come from garments in the measurement, and 
furthermore the measurement equipments are not easy available 
at anywhere and the setup are always complex. This paper 
presents an approach with fewer constraints and more simplified 
operation, and has the performance as good as manual 
measurement. In this approach, the Chinese medicine 
acupuncture theory is used to locate the position which 
measurement concerned and replace the manual marking or 
other feature extraction methods. For circumferences 
measurement, the human body slice model is supplied to 
approach the circumference shapes and used the piecewise Bezier 
curve to approximate the circumference curve. At the final, a 
compensation system of garment thickness is employed to amend 
the measurement data, which are obtained through the direct 
measuring of subject wearing clothes, to ensure the accuracy. 
Through of these methods, the subjects for measurement are not 
required for wearing as little as possible and the results of 
experiments also shows that the approach is quite comparable to 
traditional measurement methods. 

Keywords- anthropometric measurement; Chinese medicine 
acupuncture; garment thickness 

I. Introduction 

A For long time ago, people knew how to use tools to take 
a measurement of human body, stature, lengths, and 
circumferences, and use these anthropometric data for 
manufactory product, garment designed, statistic... etc. 
Nowadays, the anthropologic measurement is more widely 
used than before and become an indispensable item in human 



life. With the rapid development of internet worldwide, the 
consumption pattern has been changed. The virtual shop online 
is a novel type of shopping platform that the customer can 
make a purchase at anytime, anywhere, and no distance limit 
thought internet network. Hence, the method for 
anthropometric measurement without complex setting and no 
experience required is necessary. 

In traditional anthropometric measurement methods, the 
measurement can be done by using a simple instrument such as 
tape and without complex measurement pre-setting. The 
anthropometric data can be classified into two types: linear 
distance and circumference. The linear part is defined as the 
distance between two anatomical landmarks. The 
circumference part is defined as the length that can close 
around the part at predefined location. Through the use of 
manual measurements by domain experts, combined with 
precise measurement devices, an accurate model of human 
body and its measurement data can be obtained [1]. Although 
the traditional anthropometry measurement is easy and 
convenient to use, the traditional measurement relies on manual 
operations that are inefficient and prone to errors. 

2D image-based anthropometric measurement methods 
adopt two or more photographic to do the anthropometric 
measurement. Generally, the camera must be set and calibrated 
before capturing image. Through image process and geometric 
transformation, the anthropometric measurement data can be 
obtained from captured images. Meunier and Yin [2] proposed 
an anthropometric measurement system that can generate body 
measurements from two-dimension images. Their system 
comprised two synchronized and calibrated cameras that 
captured two images: front and side. Six anthropometric 
measurement data (neck circumference, chest circumference, 
hip circumference, waist circumference, stature, and sleeve 
length) can be measured and compared with manual 
measurement by using tape for efficient evaluation. Hung et al. 
[3] used geometric shapes (ellipse and rectangular) to 
approximate the shape of critical part circumference. The 
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ellipse shape was used to approximate neck, wrist, and palm 
circumferences, and the major and minor axes length were 
obtained from the front and side views. The combination of a 
rectangle and an ellipse was used to approximate the chest 
circumference, and with the calculation of perimeter the chest 
circumference can be obtained. Although, 2D image-based 
methods can be used for anthropometric measurement, the 
complicated pre-setting, camera setting and calibration, and too 
much manual operations make it not easy for people to use. 

There have been significant hardware and software 
development in laser scanning technique during the last two 
decades. Nowadays, the 3-D laser scanning technique makes it 
possible to digitalize the complete surface of human body and 
provides much richer information about the body shape than 
the traditional anthropometric measurements [4]. CAESAR, 
which stands for Civilian American and European Surface 
Anthropometry Resource, is the first large scale 3-D 
anthropometry survey project [5]. This approach provides a 
standard model of digitalized human body shape and opens up 
opportunities to extract new measurements for quantifying the 
body shape. The first attempt in processing 3-D anthropometric 
data for analyzing the body shape is to extract traditional 
anthropometric measurements from the scanned data [6]. 
Although working with the 3-D surface data has the advantage 
of being able to perform repeated measurements without the 
subject being present, the 3-D scanning equipment is not 
available in anytime and anywhere and the collected 3-D 
scanning data still needs complicated analysis to calculate 
human body anthropometric data. Furthermore, the 3-D human 
body scanning data would possibly touch off privacy right 
because of its rich and detail information of human body shape. 

This paper describes an approach to make an 
anthropometric measurement with fewer constraints, no high- 
price measurement instruments and complicated pre-setting. 
The Chinese medicine acupuncture theory, which has a 
distinctive way to locate the acupuncture points on human body, 
is used to fast locate the position of critical parts. The human 
body slice model accessed from the slice shape analysis of 3-D 
scanning data at critical part is applied to approximate the 
circumference shape. The fuzzy inference system is adopted to 
evaluate the influence of garment for anthropometric 
measurement, and the compensation system is applied to 
correct the measured data. The objective of this paper is to 
compare the accuracy and precision of measurements made 
from 2D images of human with those of highly trained 
anthropometric expert [2]. 

This paper is organized as follows. Section II provides a 2D 
image-based anthropometric measurement that used two side 
images, front and side, which captured by camera, and the 
human body slice model is supplied to approximate the critical 
circumference shape. In section III, the compensation system, 
including garment thickness estimation and measurement 
compensation, is proposed. The experimental results and 
discussion are showed in section IV and section V respectively. 
Section VI concludes this paper. 
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Figure 1 . The flowchart of 2-D image-based anthropometric measurement. 



II. 2D IMAGE-BASED ANTHROPOMETRIC MEASUREMENT 



In this section, the method of 2D image based 
anthropometric measurement is provided, and includes system 
description, image preprocessing, Chinese medicine 
acupuncture theory for position locating, and human body slice 
model. At the first, system description is to descript the 
measurement instruments and the requirements of system. 
Secondly, image preprocessing is used to process the images 
captured from camera and acquire the information to later 
analysis. Thirdly, the Chinese medicine acupuncture theory is 
adopted to locate the critical position because of its distinctive 
acupuncture point locating method of human body. After 
locating the critical position, the 2D anthropometric data can be 
obtained by direct measurement. In order to acquire the 
circumference, the human body slice model is used to 
approximate the circumference shape. The flowchart of this 
system illustrated in Fig. 1 . 

A. System description 

The system under review is a PC-based system and 
comprised of one Fuji F40 color digital camera (1280 x 960 
pixels) and a white backdrop. The camera is set on tripod with 
about one meter off the ground, and dual-axis bubble levels on 
tripod help to keep the camera level. The system captures front 
and side images of objects standing with some required 
postures. In the front view, the object stand with their arms 
straight and abducted about 45 degree angles from body, and 
the legs straight and slightly abducted. In the side view, the 
object stand with their arms straight and close to body, and the 
legs straight and slightly abducted. In both views, the object is 
required for all fingers closing together without bending, eyes 
looking forward, no glove, and no shoes. The requirement of 
palm is in order to measure the width of four fingers (forefinger, 
middle finger, ring finger, and little finger), which can be used 
for computing the spatial resolution and locating position and 
will be described in next paragraph and Section II. C. Fig. 2 
shows the required postures of object in two views. 

In this paper, the system is assumed to know the 
anthropometric data (stature, weight, and width of four fingers) 
of individual at the beginning. Through the known 
anthropometric data, the Body Mass Index (BMI) can be 
calculated and applied for rough classification of human body. 
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Figure 2. The postures of front and side view. 

The spatial resolution method, the ratio for direct mapping of 
pixel to actual length, is used to replace the camera calibration 
for simplification. By that way, the actual length of object can 
be calculated if the pixels belong to object length in the image 
is knew. Since the actual width of four fingers is known, the 
spatial resolution of system can be obtained by counting the 
pixels belong to width of four fingers in image. 

B. System description 

The purposes of this part are to extract the human body 
shape from captured image and label the skin regions, where 
without clothes covered. To extract the correct human body 
shape is very important for 2D anthropometric measurement, 
because the error at the extracted human body shape will be 
amplified in final result of anthropometric data. The extraction 
of human body shape is performed by automatic image 
segmentation [7] [8], which uses color edge extraction and 
seeded region growing (SRG) to do image segmentation. The 
purpose of SRG is to merge the part of the same homogeneous 
region. It is also possibility that some regions are actually part 
of the same homogeneous region but have been split because 
the initial given seeds for them are different. Hence, the 
selected region seed should represent the distinguishing 
character of the corresponding region. For our application, the 
centroids of head, limbs, and body trunk are good as the seeds 
for object generation. So we chose six centriods of head, right 
and left hand (palm), body trunk, and right and left foot (sole of 
foot) as seeds for region growing. Except for the centriod of 
body trunk, the regions for seed selected are all belong to skin 
region. For this reason, the skin detection method developed by 
Albiol et al. [8] is used to extract skin color pixels and median 
filter is applied to eliminate the isolated pixel. Through the 
connected component labeling method, the skin region can be 
labeled as head, right and left hand (palm), and right and left 
foot (sole of foot). The centriod of body trunk is defined as the 
mass center of triangle which vertices are the centriods of head, 
right hand, and left hand. Fig. 3(a) shows the edges of captured 
front image by performing the Sobel edge detector and Fig. 3(b) 
shows the obtained skin area. By performing the SRG, the 
human body mask, Fig. 3(c), can be acquired. 

In order to access the spatial resolution described in section 
II.A, the pixel belong to four fingers width must be counted. 
The definition of four-finger width is the width of four closed 
fingers at the position of second forefinger knuckle and Fig. 4 
illustrates the four-finger width. Actually, the correct position 




Figure 3. 



The segmentation of the body from the background, (a) edges (b) 
skin area (c) human body mask 




Figure 4. Illustrate the definition of four-finger width. 

of second knuckle of forefinger is hard to recognize, especially 
in silhouette. Hence, the four-finger width is calculated for the 
average width at the restricted region around second knuckle of 
forefinger. 

There are two steps to extract the width of four fingers in 
2D image. The first step is to rotate the right hand skin region 
to horizontal with the rotation angle, which is defined as the 
angle between the axis of right hand skin region and horizontal 
axis of image. The second step is to make a histogram of 
horizontal right hand skin region, and the average width for 
four- finger width FWavg, can be defined as: 



FW = 

avg 



1 b 

b-a + l^ 



Jf m-n. 2(m-nV 
a - n + round ( ) , b = n + round ( ) 



(1) 



(2) 



where P(i) denotes the histogram value at the position i, n is the 
position of fingertip of middle finger, and m is the position 
where has the maximum histogram value. Take Fig. 5(a) as an 
example, the measured data n and m are 40 and 72, and the data 
a and b can also be calculated as 51 and 61 respectively. 

C. Chinese medicine acupuncture theory for locating 
position 

Acupuncture is originated in China and has a long history. 
Along with the accumulation of clinical experience, 
acupuncture treatment has become a developed theory. 
Acupuncture is the procedure of inserting filiform needles into 
various acupuncture points on the body to relieve pain or for 
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Figure 5. Illustrate the procedure of four-finger width calculation, (a) the 
horizontal right hand skin area and (b) histogram of (a) 

therapeutic purposes. In treatment procedural, the therapist 
should find out the correct acupuncture point position 
corresponding to pain before inserting needle. Hence, the 
acupuncture point location method is developed to locate the 
acupuncture points on human body fast and accurately. The 
most popular used method of acupuncture point location is 
finger-inch [9], which uses a thumb width as the measurement 
unit and denotes the distance between acupuncture point and 
neighbor acupuncture points with measurement unit. In 
generally, the width of four fingers (forefinger, middle finger, 
ring finger, and little finger), has been described and defined in 
past paragraph, can be denoted as 3 units in finger-inch method. 

In this paper, five body measurement data (shoulder length, 
chest circumference, waist circumference, hip circumference, 
and leg length) are selected for discussion because there are the 
most commonly used in manufactory and general application, 
such as clothing size. For locating the corresponding position, 
six acupuncture points (Lian-Quan, Tian-Tu, Chien-Yu, Shan- 
Zhong, Shen-Que, and Qu-Gu) are taken to locate the critical 
positions. The position of Lian-Quan acupuncture point is near 
the center of hyoid bone, so it can be represented as the point of 
chin. The position of Tian-Tu is very close to center of two 
Chien-Yu acupuncture points which are located at point of 
shoulder joints and the distance of these two acupuncture 
points is denoted as the shoulder length. The position of Shan- 
Zhong acupuncture point is at the center of nipples. Therefore, 
Shan-Zhong acupuncture point can be denoted as the position 
of chest. The positions of Shen-Que acupuncture point and 
navel are virtually identical, so it can be denoted as the position 
of waist. The position of Qu-Gu acupuncture point is near the 
upper edge of pubic symphysis, and could be defined as the 
position of hip. Because these five acupuncture points are all 
located at the trunk axis line of human body, the sequence of 
these four acupuncture from top to down (Lian-Quan, Tian-Tu, 
Shan-Zhong, Shen-Que, and Qu-Gu) could be obtained and the 
relative distance are two units, five units, eleven units, and five 
units of finger-inch respectively. The Fig. 6 shows the position 
of acupuncture point on human body. 

D. 2D anthropometric measurement 

The measurement data can be separated into two groups, direct 
measurement (linear) and indirect measurement 
(circumference). In linear part, the measurement is to calculate 
the distance between two acupuncture points: shoulder length is 




Qli-GlJ 



Figure 6 . Illustrate the position of acupuncture points on human body. 

the distance between two Chien-Yu acupuncture points or to 
calculate the distance between the acupuncture point and 
extremity: leg length is the distance between the Qu-Gu 
acupuncture point and sole of foot. Circumference cannot be 
measured directly using front and side view images only and 
must therefore be calculated using some form of mathematical 
model. In this paper, the human body slice model is adopted to 
approximate the circumference shape, and the circumference 
could be obtained through the calculation with two parameters, 
the corresponding width of front and side view image. 

E. 2D anthropometric measurement 

The measurement data can be separated into two groups, 
direct measurement (linear) and indirect measurement 
(circumference). In linear part, the measurement is to calculate 
the distance between two acupuncture points: shoulder length is 
the distance between two Chien-Yu acupuncture points or to 
calculate the distance between the acupuncture point and 
extremity: leg length is the distance between the Qu-Gu 
acupuncture point and sole of foot. Circumference cannot be 
measured directly using front and side view images only and 
must therefore be calculated using some form of mathematical 
model. In this paper, the human body slice model is adopted to 
approximate the circumference shape, and the circumference 
could be obtained through the calculation with two parameters, 
the corresponding width of front and side view image. 

F. Human body slice model 

Circumference cannot be measured directly using only 2D 
measurement, and therefore the mathematical model is applied 
to approximate the corresponding circumference. In this paper, 
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the human body slice model which using Bezier curve [10] is 
used to approximate the circumference. Bezier curve is wildly 
used in computer graphics to model smooth curves, and can be 
applied to approximate the curve by using the respective 
transform on the control points. The quadratic and cubic Bezier 
curve functions are most common used, because the higher 
degree Bezier curve function is more expensive to evaluate. If 
the more complex shape is needed, the lower degree Bezier 
curve can be patched together, which also called piecewise 
polynomial curve. In this paper, the cubic Bezier curve is 
adopted for basic curve to patch the piecewise curve. The cubic 
Bezier curve content with four control points: P , P\, P 2 , and P 3 , 
which existing on the same plane of curve. The cubic Bezier 
curve B(t) starts from P to Pi and arrive at P 3 , and the function 
can be denoted as follow: 

B(t) = (l-t) 3 P +3(l-t) 2 tP 1 +3(l-t)t 2 P 2 +t 3 P 3 , re [0,1]. 
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The 3-D human body scanning technique makes it possible 
to digitalize the complete surfaces of large number of human 
bodies, and provides much richer information about the body 
shape than the traditional and 2D anthropometric measurement. 
If the complete digitalized human body data is available, the 
slice data of human body can be extracted with the plane which 
is concerned. The Taiwan human body bank (TAIBBK) 
[11] [12], which stands for Industrial Technology Research 
Institute, Tshing Hua University, and Chang Gung University, 
is a large scale 3-D anthropometric measurement project. 
About 1100 civilians, between the age of 19 and 65 in Taiwan, 
were scanned. 

Because the build of fat man and thin man or male and 
female is much different, the slice models of human body must 
be set up to different build types respectively. For this reason, 
100 people (50 males and 50 females) were selected in 
TAIBBK database, and the males and females were divided 
into five clusters on the basis of BMI (BMI<18.5, 18.5 <BMI< 
25, 25<BMI<30, 30<BMI<35, and BMI > 35) respectively. 
Then, the slice human model, which contain with shapes of 
chest, waist, and hip, of each build types can be obtained. Fig. 
7 shows the 3-D virtual mannequin and corresponding slice 
shape of standard male build (18.5 < BMI <25). 

In generally, the shape of human body is virtually bilateral 
reflection symmetry with respect to the center mirror plane. 
Furthermore, the shapes of chest, waist, and hip are also 
symmetrically in slice human model. It also means that the 
circumference can be obtained, if the curve length of 
corresponding half-shape is known. Fig. 8(a) shows the whole 
and half chest slice shape of standard male build type, and 
confirms the symmetry of slice shape model. In this paper, the 
piecewise polynomial curve based on cubic Bezier curve is 
used to approximate the curve of half-shape, and then the curve 
length can be calculated by using definite integral. Fig. 8(b) 




Figure 7. Illustrate the 3-D virtual mannequin, chest slice shape, waist slice 
shape, and hip slice shape of standard build of male. 
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Figure 8. (a) The whole and half chest shape of standard male build type, (b) 

The approximation curve and actual curve of chest, (c) The approximation 
curve and actual curve of waist, (d) The approximation curve and actual curve 

of hip. 

shows the curve approximation with two cubic Bezier curves, 
and two sets of control points are marked with blue circle and 
star. The red curve is actual curve of chest slice shape and the 
blue dotted curve is the approximation curve, which is similar 
to manual measurement. 

Then, the expression of piecewise polynomial curve can be 
denoted as follow: 



C{t) = C m +C bz2 =TB c (P l +P 2 ) 



(4) 
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where PI and P2 are the control point sets of anterior curve 
(C bzl ) and posterior curve (C bz2 ) respectively. Now, we take the 
chest circumference measurement of standard male build type 
for example. The expression of approximation curve, showed 
in Fig. 8(b) and followed from Eq.( 4) is showed as: 





C chesl (t) = TB c (P cX +P c2 ) 




(5) 
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width of chest in front image respectively. Similarly, the 
approximate curves of waist and hip of standard man build type 
are show in Fig. 8(c) and Fig. 8(d) respectively, and the 
approximate curve of waist is given by expression 
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where L wl and L w2 are the width of waist in side image and half 
width of waist in front image respectively, and the approximate 
curve of hip is given by expression 



C hip {t) = TB c {P hl + P h2 ) 
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where Lhl and Lh2 are the width of hip in side image and half 
width of hip in front image respectively. 

III. The compensation system for measurement 

In general 2D image-based anthropometric measurement, 
the subject has been required for no -wearing clothes or only 
wearing light underwear. Therefore, the application of 



anthropometric measurement and the place where measurement 
can be used are limited. In order to deal with this issue, an 
efficient compensation system that aims to reduce the influence 
of subject wearing clothes for measurement is presented in this 
paper. This compensation system is composed of two parts; 
estimation of garment thickness and compensation of 
measurement. Because the garment has a great influence on 
measurement of chest circumference and waist circumference 
obviously, the compensations of chest and waist measurement 
are concerned here. The most of pants are close to buttock at 
hip, so the influence of pant on hip measurement is slight and 
can be neglected. The two direct measurement data, shoulder 
length and leg length, also could be neglected because of slight 
influence for distance measurement. 

A. Estimation of garment thickness 

Strictly, the influence of garment on chest and waist 
measurement does not only belong to garment thickness, but 
also to the properties of garment; such as the loose or fitting 
garment. But now we treat them as the same impact factor for 
garment measurement. Fuzzy inference systems have been 
successfully applied in fields of decision analysis and expert 
systems, and the Fuzzy inference is the process of formulating 
the mapping from a given input to an output using fuzzy logic. 
In this paper, the fuzzy inference system is used to infer the 
thickness of garment with given two inputs. The first input 
HC ratio , is defined as the ratio of middle arm width to maximum 
value of possible width and expressed as: 



HC ratio ~ 



1 

HC-0.6FW m 



HC m 



if H01.5FW 



otherwise 



(14) 



where HC is the width of middle arm and HC max is the 
maximum value of possible middle arm width. HC max is given 
as 1.5 times of four-finger width here. The second input Diff ratio 
is defined as the ratio of Diff , which is the difference in width 
between upper end of pant and lower end of garment, to 
maximum value of possible difference. The function of Diff ratio 
is given by 



DW ra 



1 
Diff 



Diff m , 



if Diff >l.&FW a 

otherwise 



(15) 



where Diff max is the maximum value of possible Diff and given 
as 1.8 times of four-finger width. 

Garment thickness (DT degree ) is the output of fuzzy inference 
system and lies in the interval to 1, which represent the 
thinnest and thickest respectively. Follow the Eq. (14), the 
input value HC ratio lies in the interval to 1, which represent 
the few and many respectively. Similarly, the input value 
Diff ratio lies in the interval to 1, which represent the small and 
very large respectively. The membership functions of input and 
output are trapezoid type and triangle type respectively and 
show in Fig. 9. The fuzzy rules of this system are listed in 
TABLE I. Taking one rule for example, if HC ratio is "few" and 
Diffratio is "large", and then the DT degree is "median". In 
defuzzification processes, the stander Mamdani -type method 
with COA-defuzzifier (center of area) is adopted. 
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TABLE I. 



The fuzzy rules 
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B. Compensation of measurement 

The compensation system is a negative compensation and 
modification mechanism. The goal is to retrieval the chest and 
waist measured data in 2D measurement. So the outputs of 
compensation system are the reduction rates of measurement 
data, which are the direct measurement width of chest and 
waist in front and side view. The impact factors, also called 
inputs, of this system are garment thickness DT degree md BMI. 

The neural network is an efficiently method to simulate the 
structure and functional aspects of biological neural network. 
Hence, the back-propagation neural network (BPNN) [13] is 
used to structure compensation system. The tan-sigmoid 
function is applied for transfer function and 105 (63 males and 
42 females) training data are used. 
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Figure 9 . The membership functions of inputs HC rat i and Diff rati0 and 
output DT desree 



IV. Experimental results 

This section describes the experiment results including 
accuracy and precision parts. The accuracy part adopts statistic 
methods to calculate the correlations between proposed system 
and traditional measurement system. The precision analysis is 
designed to repeated measurements by proposed and manual 
method, and aimed to show the repeatability of system. 

A. Accuracy 

The accuracy of the 2D image-based measurement system 
was calculated by comparing 2D image-based measurement 
with the manual measurement, or called traditional method, 
taken by anthropometric [14]. Two experiments, simplex 
garment, short sleeve T-shirt, and manifold garment, are 
performed in accuracy part. The first experiment is to calculate 
the accuracy of measurement system with less influence of 
garment, and the second experiment is to calculate the accuracy 
of measurement system with each kind of garment. 

In the first experiment, the test sample composed of 175 
subjects (97 males and 78 females) that have worn short sleeve 
T-shirt and been measured both with traditional method and 
with 2D image-based method. Each subject was measured once 
by each method. The means and standard deviations for 
subjects measured in these two methods are listed in TABLE II. 
The Pearson correlation coefficients between manual and 2D 
image-based measurements also list in TABLE II. The error 
rate (E) is the absolute value of difference of measurement and 
actual value (manual measurement) against the actual value. It 
is computed using the follow equation: 



measurement 



^100% 



(16) 



In the second experiment, the test sample compose with 3 1 
males and 15 females and be required to wear 4 kinds of 
garment, short T-shirt, long T-shirt, thin jacket, and thick jacket, 
for measurement. In order to assess the accuracies of 2D 
image-based measurement for each type of garment, the 



TABLE II. 



Accuracy results of simplex garment (short sleeve 
T-shirt) (mm) 



Measurement 


Males 


(w=97) 






Female 


(n=7S) 






Mean 


SD 


Corr. 


t^avg 


s Mean 


SD 


Corr. 


kavg 


Shoulder length 


















Manual 


443 


21 






407 


22 






2-D system 


442 


19 


0.90 


2.17% 


404 


21 


0.95 


1.99% 


Chest circ. 


















Manual 


1002 


73 






947 


83 






2-D system 


1006 


70 


0.96 


1.78% 


949 


80 


0.96 


1.87% 


Waist circ. 


















Manual 


999 


79 






935 


71 






2-D system 


1003 


77 


0.97 


1.74% 


937 


70 


0.98 


1.71% 


Hip circ. 


















Manual 


1011 


72 






974 


79 






2-D system 


1013 


69 


0.95 


1.82% 


973 


77 


0.96 


1.83% 


Leg length 


















Manual 


1018 


28 






977 


26 






2-D system 


1017 


26 


0.93 


1.87% 


979 


24 


0.92 


1.88% 
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TABLE III. 



Accuracy results of manifold garment (m m ) 



Measurement 


(«=46) 








Mean 


SD 


Corr. 


E an 


Shoulder length 










Manual 


442 


22 






2-D system I 


440 


20 


0.91 


1.92% 


2-D system II 


439 


21 


0.92 


1.93% 


2-D system III 


440 


21 


0.91 


2.11% 


2-D system IV 


445 


22 


0.90 


2.29% 


Chest circumference 










Manual 


987 


77 






2-D system I 


990 


76 


0.96 


1.77% 


2-D system II 


992 


77 


0.95 


1.81% 


2-D system III 


992 


76 


0.95 


2.37% 


2-D system IV 


995 


81 


0.90 


4.43% 


Waist circumference 










Manual 


989 


80 






2-D system I 


992 


80 


0.98 


1.66% 


2-D system II 


991 


79 


0.96 


1.78% 


2-D system III 


995 


81 


0.97 


3.29% 


2-D system IV 


996 


86 


0.88 


4.58% 


Hip circumference 










Manual 


1003 


76 






2-D system I 


1005 


77 


0.96 


1.88% 


2-D system II 


1004 


77 


0.95 


1.93% 


2-D system III 


1005 


75 


0.92 


1.77% 


2-D system IV 


1004 


75 


0.93 


2.09% 


Leg length 










Manual 


1004 


30 






2-D system I 


1001 


29 


0.96 


1.92% 


2-D system II 


999 


28 


0.95 


1.97% 


2-D system III 


1001 


30 


0.97 


2.89% 


2-D system IV 


1002 


31 


0.96 


2.13% 



(I: short sleeve T-shirt, II: long sleeve T-shirt, III: thin jacket, and IV: thick 
jacket) 

system measurement and once with manual measurement. The 
results of means, standard deviations, average error rate, and 
Pearson correlation coefficients are listed in TABLE III. 

B. Precision 

The precision of the 2D image-based system was 
determined by performing ten repeat measurements on subject 
wore short sleeve T-shirt for manual and 2D image-based 
method. The camera setting and lighting conditions were 
relative constant and the re-measurement must be done within 
minute to ensure the postural of subject is almost the same. The 
means, standard deviations, and difference range of repeated 
measurement is listed in TABLE IV. 

V. DISCUSSION 

This section will discuss the experiment results, which 
contains accuracy, precision, and reliability. In the accuracy 
part, the statistical methods, t-test, f-test, and Pearson 
correlation coefficient, and simple error rate are adopted for 
assessment. The higher correlation coefficient and lower error 
rate indicate the higher accuracy. The precision is defined as 
the difference in values obtained from measuring the same 



object repeatedly. The confidence interval method with 95% 
confidence level is applied to evaluate the precision of repeated 
measurement. In the same confidence level, the smaller 
confidence interval means the higher precision, or repeatability. 
The reliability coefficient is applied to measure the correlation 
of the repeated measurements and two useful parameters: the 
Technical Error of Measurement (TEM) and reliability 
coefficient (R) [15], are used to characterize reliability of 
system here. 



TABLE IV. 



Measurement repeatability results (mm) 



Measurement 


Mean 


Range 


Std. 


1.96 Std. 




(w=10) 




Dev. 


Dev. 


Shoulder length 










Manual 


438 


3 


1.87 


3.67 


2-D system 


433 


5 


1.97 


3.86 


Chest circumference 










Manual 


991 


12 


4.56 


8.93 


2-D system 


1001 


15 


4.77 


9.35 


Waist circumference 










Manual 


924 


9 


2.95 


5.78 


2-D system 


938 


12 


3.46 


6.78 


Hip circumference 










Manual 


983 


8 


2.06 


4.03 


2-D system 


996 


12 


2.51 


4.92 


Leg length 










Manual 


1004 


9 


2.26 


4.43 


2-D system 


1017 


11 


2.45 


4.80 



A. Accuracy 

Two statistical tests: f-test and /-test, and error rate are 
sufficient to characterize the accuracy of an anthropometric 
measurement. The f-test is performed to compare the means of 
all dimensions, and /-test is performed compare the standard 
deviation of all dimensions. The error rate is a simple way to 
quantify the accuracy and it is also easy to interpret the amount 
of difference between two measurements. The Pearson 
correlation coefficient is a measure of the strength of the 
association between the two set of measurement, and could 
also to be used to represent the degree of accuracy. 

In first experiment results, f-tests and /-tests comparing of 
manual and 2D image-based measurements show no significant 
difference, indicating that the results of these two methods are 
similar and equally consistent. The results of Pearson 
correlation coefficient show that the closely perfect positive 
correlations were calculated and exhibited these two 
measurement data have similar. The results of average error 
rate were less than 2.5 %, and showed the perfect accuracy of 
2D image-based measurement. 

The second experiment performed to compare the 2D 
image-based measurement for each type of garment. Although 
the challenge of four type of garment was added in this 
experiment, the f -tests and /-tests comparing of manual and 2D 
image-based measurements show no significant difference 
(p<0.05), indicating that the similarity of results of these two 
methods is more than 95 %. In other words, the accuracy of 2D 
image -based measurement is more than 95%. The results of 
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Pearson correlation coefficient exhibit the highly positive 
correlation of 2D image-based measurement and manual 
measurement. No matter what type of garment was worn, the 
average error rates of each dimension are less than 5 %. It can 
be proved that the compensation system is feasible to reduce 
the influence of garment thickness on measurement. 

Although the accuracy of 2D image-based system had been 
proofed to reach more than 95% from the results of two 
experiments, the error of measurement can not be ignored. The 
error in spatial resolution, which came from the calculation of 
four-finger width, will be spread to each measurement, and the 
effect of indirect measurement (circumference) is more than 
linear measurement in theoretical assessment. Within the same 
factor of error in spatial resolution, calculation of four-finger 
width, the error of critical position is built from the location 
procedure using Chinese medicine acupuncture theory. 

The error of circumference measurement came from the 
imperfect approximation of circumference shape, and the 
integrities of inference system of garment thickness and 
compensation would cause the accuracy. In this paper, the 
human body slices model and inference systems are performed 
to indirect measurement and has good performance on 
accuracy, indicating that the adequate approximation model 
and inference systems are presented. 

B. Precision 

The precision is defined as the difference in values obtained 
from measuring the same object repeatedly. As show in 
TABLE IV, the results of repeated measurement showed the 
error of manual measurement (observed by skilled man) to be 
within around 3 to 9 mm of means, 95% of the time. For the 
most part, the linear measurements are more repeatable and 
precise than circumference, and the hip circumference are more 
repeatable and precise than other circumferences. The results in 
TABLE IV show that the repeated measurements by 2D image- 
based method exhibited the error of precision to be within 
around 3 to 10 mm of means, 95% of the time, and showed the 
same basic trend as for the manual in that the linear 
measurements were more precise than circumferences, and hip 
circumference was more repeatable than other circumferences. 
The results of chest circumference and waist circumference in 
both methods exhibited more variability than others 
measurements, which was anticipated. The reason of this can 
be partly interpreted by torso movement due to the expansion 
and contraction of ribcage and abdomen of breathing and 
differences in posture of subject. 

C. Reliability 

The reliability is a kind of correlation coefficient 
measurement and used to measure the correlation of the 
repeated measurements here. Reliability is quantified in 
anthropometric studies using the Technical Error of 
Measurement (TEM) [15], which is essentially a form of 
standard deviation, and reliability coefficient. The TEM, or 
called r, was calculated using the following equation 




-(X)&) 2 ) 



n(k-l) 
squared value 



(17) 

where x/ is the squared value of the j th replicate 
(j=l,2,...,k ), n is the number of subjects, and k is the number 
of measurements per subject. The reliability coefficient (R) [15] 
is calculated due to the measurement error (r 2 ) against the 
sample variance (s 2 ) and calculated with follow expression 

R = l-(r 2 /s 2 ). (18) 

The TEM and R of 2D image-based measurement in precision 
experiment are shown in TABLE V. 

According to the definition of reliability coefficient, if the 
technical measurement error is much smaller than the sample 
variance, the reliability coefficient of measurement will be high. 
As the show in TABLE V, the reliability coefficients of 2D 
image-based measurement are all above 99 % and proofed that 
the 2D image-based system has highly reliability and 
repeatable. 



TABLE V. 



The TEM and reliability coefficients of 2-D image- 
based MEASUREMENT 



Measurement 


TEM (r) 


R 


Shoulder length 


0.11 


99.73% 


Chest circumference 


0.32 


99.41% 


Waist circumference 


0.37 


99.50% 


Hip circumference 


0.29 


99.29% 


Leg length 


0.28 


99.39% 



VI. Conclusion 

This paper introduces a new approach to anthropometric 
measurement based on 2D image, and the results of experiment 
show that the approach is quite comparable to traditional 
measurement method performing by skilled anthropometrists, 
both in terms of accuracy, precision and repeatability. The 
advantage of using this approach is that the people can easy to 
take an anthropometric measurement by using digital camera, 
which is very popular today, without heavy and complicated 
setting, and the high performance of accuracy and reliability 
compared to manual measurement could be obtained. 
Furthermore, the proposed method can provide the 
measurement for people regardless of where, when, or by 
whom. 

For many applications, such as large scale anthropometric 
investigation of population, it can be done by using the 
proposed system, which has high performance of accuracy and 
low cost. In the future, the new type of apparel merchandising 
would be proposed that the customer can buy the clothes at 
home through internet and less return for unfitting reason by 
using 2D measurement in advance. Furthermore, the 
customized clothe could be made by only using photographs 
and without he or she physical presence. 
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Abstract — In order to improve the fractal image encoding, we 
propose a fast method based on the Haar wavelet transform. This 
proposed method speed up the fractal image encoding by 
reducing the size of the domain pool. This reduction uses the 
Haar wavelet coefficients. The experimental results on the test 
images show that the proposed method reaches a high speedup 
factor without decreasing the image quality. 

Keywords- Fractal image compression, PIFS, Haar wavelet 
transform, SSIM index. 

I. Introduction 

Fractal image compression (FIC) is one of the recent 
methods of image compression firstly presented by Barnsley 
and Jacquin [1-5]. This method is characterized by its high 
compression ratio which is achieved with an acceptable image 
quality [6], a fast decoding and a multi-resolution property. It is 
based on the theory of Iterated Function System (IFS) and on 
the collage theorem. Jacquin [3-5] developed the first algorithm 
of FIC by Local or Partitioned Iterated Function Systems 
(PIFS) which makes use of local self- similarity propriety in 
real images. In FIC, the image is represented through a 
contractive transformation defined by PIFS for which the 
decoded image is approximately its fixed point and close to an 
input image. 

In Jacquin' s algorithm, an input image is partitioned into 
non-overlapping sub-blocks Ri called range blocks, the union 
of which covers the whole image. Each range block R^ is put in 
corresponding transformation with another part of a different 
scale, called domain block, looked for in the image. The 
domain blocks can be obtained by sliding a window of the 
same size around the input image to construct the domain pool. 
The classical encoding method, i.e. full search, is time 
consuming because for every range block the corresponding 
block is looked for among all the domain blocks. Several 
methods are proposed to reduce the time encoding. The most 
common approach is the classification scheme [6-10]. In this 
scheme, the domain and the range blocks are grouped in a 
number of classes according to their common characteristics. 
For each range block, comparison is made only for the domain 
blocks falling into its class. Fisher's classification method [6] 
constructed 72 classes for image blocks according to the 
variance and intensity. In Wang et al. [10], four types of range 



blocks were defined based on the edge of the image. Jacobs et 
al. uses skipping adjacent domain blocks [11] and Monro and 
Dudbridge localizes the domain pool relative to a given range 
block based on the assumption that domain blocks close to this 
range block are well suited to match the given range block [12]. 
Methods based on reduction of the domain pool are also 
developed. Saupe's Lean Domain Pool method discards a 
fraction of domain blocks with the smallest variance [13] and 
in Hassaballah et al., the domain blocks with high entropies are 
removed from the domain pool [14]. Other approaches focused 
on improvements of the FIC by tree structure search methods 
[15, 16], parallel search methods [17, 18] or using two domain 
pools in two steps of FIC [19]. The spatial correlation in both 
the domain pool and the range pool was added to improve the 
FIC as developed by Truong et al. [20]. Tong [21] proposes an 
adaptive search algorithm based on the standard deviation 
(STD). Other approaches based on genetic algorithms are also 
applied to speed up the FIC [22-23]. In these methods, higher 
speedup factor are often associated with some loss of 
reconstructed image quality. In the present work, a new method 
is proposed to reduce the encoding time of FIC using the Haar 
wavelet transform. It speeds up the time encoding by 
discarding the smooth domain blocks from the domain pool. 
The type of these blocks is defined using the Haar wavelet 
transform. A high speedup factor is reached and the image 
quality is still preserved. 

II. The proposed method based on Haar wavelet 

TRANSFORM 

A. The Haar wavelet transform 

The Haar Wavelet Transform (HWT) [24] is one of the 
simplest and basic transformations from the space domain to a 
local frequency domain and it is a very useful tool for signal 
analysis and image processing. The HWT decompose a signal 
into different components in the frequency domain. One- 
dimensional HWT decomposes an input sequence into two 
components (the average component and the detail component) 
by applying a low-pass filter and a high-pass filter. In the HWT 
of 2D image of size NxN, a pair of low-pass and high-pass 
filters is applied separately along the horizontal and vertical 
direction to divide the image into four sub-bands of size 
N/2xN/2 (Fig. 1). After one level of decomposition, the low- 
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low-pass sub-band LL is the multiresolution approximation of 
the original image, and the other three are high frequency sub- 
bands representing horizontal, vertical and diagonals edges, 
respectively. The LL band is again subject to the same 
procedure. 



LL 


HL 


LH 


HH 



Figure 1 . The result of 2D image HWT decomposition. 

This wavelet decomposition can be repeatedly applied on 
the low-low-pass sub-band at a coarser scale unless it only has 
one component as shown in fig. 2. 

Let D be a given image block of size NxN. D can be 
decomposed into one component low-pass signal by a log 
N/log 2 pyramidal HWT. In the case of N=8, D can be 
transformed to one component by 3 decomposition (Fig. 2). 
The LL 3 band in level 3 is the multiresolution approximation of 
LL 2 bands in level 2. The coefficients HL^LfLand HH 3 of the 
highest level denote the coarsest edges along horizontal, 
vertical and diagonal directions respectively in level 2. 



LL 3 


HL 3 


HL 2 


HL 1 


LH 3 


HH 3 


LH 2 


HH 2 


LH 1 


HH 1 



Figure 2. 



The result of three level HWT pyramidal decomposition of an 
image block of size 8x8. 



We will refer to these coefficients obtained at the highest 



level hereafter as WH D for HL 
WD D . 



WV D for LW and HH J for 



If both WH D and WV D are small, then the block D tends to 
have less edge structure (smooth block). When a block has high 
degree of edge structure, either WH D or WV D will be large. If 
WH D is larger, D will have horizontal edge properties. On the 
other hand, if WV D is larger, then D will have vertical edge 
properties. Finally, those blocks with high magnitudes of WH D 
and/or WV D are designed as heterogeneous domain blocks. The 
type of each block D is determined as follows: 



if |wH D |<T w and|wv D |<T w 

then D is a smooth domain block 
else D is a heterogeneous domain block 



(1) 



where I.I denotes the absolute value of its variable and T w is a 
threshold. 



Thus, an image block D can be determined as belonging to 
smooth or heterogeneous type by using its vertical coefficient 
WV D and its horizontal coefficient WH D obtained by a 
pyramidal HWT at the highest level. 

The computation of WH D and WV D do not require the 
calculation of other wavelets coefficients. Indeed, let D be an 
image block of size 4x4 represented as follows: 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 



Then, analyzing the expression of the coefficients WH D and 
WV D obtained after two pyramidal HWT, allow us to find the 
following simplified formula: 



WHr 



WVi 



D : 



A-B 

16 

C-D 

16 



(2) 



(3) 



where A = l + 2 + 5 + 6 + 3 + 4 + 7 + 8 ,B = 9 + 10 + 13 + 14 + 11 + 12 + 15 + 16, 
C = 1 + 2 + 5 + 6 + 10 + 13 + 14 , D = 3 + 4 + 7 + 8 + 11 + 12 + 15 + 16. 

B. The proposed method 

The proposed method is aimed to reduce the encoding time 
by reducing the cardinal of the domain pool. As only a fraction 
of the domain pool is used in fractal encoding and the set of the 
used blocks is localized along edges and in the regions of high 
contrast of the image (designed as heterogeneous blocks), it's 
possible to reduce the cardinal of the domain pool by 
discarding the smooth domain blocks. Therefore, each range 
bloc is compared only to the heterogeneous domain blocks. 
This method of reduction of the domain pool is simple since 
only few computations are required to calculate the coefficients 
WH D and WV D of a domain block D to classify it as smooth or 
heterogeneous. 

The threshold T w can be fixed or chosen in an adaptive 
way. Determining T w adaptively allow us to choose the 
speedup ratio. The main idea is to set the thresholds such that a 
fraction a of the domain pool can be discarded. The value of a 
can be one third, tow thirds,... of the domain pool. Due to the 
fact that the encoding time depends on the number of 
comparisons between range and domain blocks, the speedup 
ratio can be estimated. 



The determination of the threshold T w , 
the fraction a of the domain pool to 
summarised as follows: 



which depends on 
be eliminated, is 



For each domain block D, calculate the Haar wavelet 
coefficient WH D and WV D . Set S D =max(IWH D l, 

IWV D I). 

Sort all the values of S D in increasing order. 
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• Find S* corresponding to the value of a. Set the 
threshold T w =S*. 

Due to the fact that we apply our method in the case of a 
quadtree partitioning, we choose different thresholds for every 
size of the domain blocks. 

The first steps of the proposed method are as follows: 

• Choose a value of a. 

• Construct the domain pool. 

• Compute the Haar wavelet coefficients IWH D I and 
IWV D I for each domain block D. 

• Determine the threshold T w for each size domain 
block. 

• Remove from the domain pool the smooth domain 
blocks. 

III. Experimental results 

The different tests are performed on three 256x256 images, 
represented in fig. 3, with 8 bpp on PC with Intel Pentium Dual 
2.16 Ghz processor and 2 GO of RAM. The quadtree 
partitioning [6] is adopted for the FIC. The encoding time is 
measured in seconds. The rate of compression is represented by 
the compression ratio (CR), i.e. the size of the original image 
divided by the size of the compressed image. The speedup 
factor (SF) of a particular method can be defined as the ratio of 
the time taken in full search to that of the said method, i.e., 



SF: 



Time taken in full search 
Time taken in a particular method 



(4) 



The image quality is measured by the peak signal to noise 
ratio (PSNR) and the structural similarity Measure (SSIM) 
index [25]. 

The PSNR of two images X and Y of sizes N, is defined as 
follows: 



PSNR =10x log 



'255^ 
MSE 



where 



MSE 



i=l 



(5) 



(6) 



Xi and yi are the gray levels of pixel of the original image 
and the distorted image respectively. 

The SSIM index is a method for measuring the similarity 
between two images x and y defined by Wang [25] as follows: 



SSIM(x,y) = 



(2jLi x! Li y + C i X2cT xy + C 2 ) 



(u. 2 +u. 2 +C 1 )(c 2 +a 2 +C 2 ) 



where ]i x =— I>i, u. y =— lyi, a x =(^~ l Y, (Xi ~^ )2)2 ' 



CI and C2 are positive constants chosen to prevent unstable 

]) or (Ox+Sy) 



measurement when (ji 2 + u.y) or (a 2 +oh is close to zero. They 



are defined in [25] as: 



C { = (KiL) 2 , C2= (K 2 L) 2 



(8) 



where L is the dynamic range of pixel values (L= 255 for 8-bit 
gray scale images). Ki and K 2 are the same as in [20]: Kl= 
0.01 and K2= 0.03. 

In the present work, we use a mean SSIM (MSSIM) index 
to evaluate the overall image quality: 



MSSIM(X,Y) = — JTsSIMCx^Vi) 



(9) 



i=l 



where X and Y are the original and the distorted images 
respectively; x^ and yi are the image contents at the i th local 
window of size 8x8 and M is the number of local windows of 
the image. 




(a) 




(b) 
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TABLE II. The results of the proposed method for Baboun image. 



(c) 
Figure 3. Images of size 256 x 256 : Lena (a), Peppers (b) and baboun (c). 

Table 1 and 2 gives the encoding time, the compression 
ratio, the image qualities and the speedup factor measured on 
the three test images for different values of (1-oc). The full 
search occurs when oc=l and there is no time reduction because 
no domain block is eliminated. The results show that the 
encoding time scales linearly with a as illustrated in fig. 4. For 
values of a between 0.9 and 0.3, there is no degradation in the 
image quality. On contrary, the PSNR improves slightly for the 
test images. The SF of 10 causes a drop of PSNR of 0.6 dB, 
0.55 dB and 0.07 dB for Lena, Peppers and Baboun images 
respectively. 



a 


Baboun 


Time 


CR 


PSNR 


MSSIM 


SF 


1 


25.41 


7.46 


32.55 


0.8802 


1.00 


0.9 


22.29 


7.45 


32.55 


0.8805 


1.14 


0.8 


21.54 


7.45 


32.56 


0.8805 


1.18 


0.7 


17.07 


7.40 


32.57 


0.8813 


1.49 


0.6 


14.71 


7.33 


32.63 


0.8829 


1.73 


0.5 


12.95 


7.33 


32.64 


0.8831 


1.97 


0.4 


10.84 


7.27 


32.65 


0.8828 


2.35 


0.3 


7.74 


7.22 


32.66 


0.8827 


3.29 


0.2 


5.40 


7.11 


32.65 


0.8829 


4.72 


0.1 


2.87 


6.90 


32.57 


0.8806 


8.87 


0.08 


2.31 


6.81 


32.48 


0.8785 


11.03 


0.06 


1.72 


6.73 


32.43 


0.8769 


14.81 


0.04 


1.39 


6.66 


32.24 


0.8741 


18.32 


0.02 


0.84 


6.42 


31.95 


0.8693 


30.32 


0.008 


0.56 


6.29 


31.67 


0.8632 


45.48 


0.006 


0.47 


6.21 


31.22 


0.8533 


54.19 


0.004 


0.45 


6.22 


31.18 


0.8523 


56.60 



TABLE I. 



The results of the proposed method for Lena and 
Peppers images. 



a 


Lena 


Peppers 


Time 


CR 


PSNR 


MSSIM 


SF 


Time 


CR 


PSNR 


MSSIM 


SF 


1 


19.64 


10.46 


30.92 


0.8909 


1.00 


19.64 


10.98 


31.91 


0.8931 


1.00 


0.9 


18.17 


10.41 


30.94 


0.8915 


1.08 


16.71 


10.98 


31.91 


0.8931 


1.18 


0.8 


15.32 


10.35 


30.98 


0.8933 


1.28 


15.18 


10.94 


31.93 


0.8936 


1.29 


0.7 


14.13 


10.30 


30.99 


0.8934 


1.39 


13.85 


10.92 


31.92 


0.8936 


1.42 


0.6 


11.72 


10.19 


31.05 


0.8948 


1.68 


11.40 


10.81 


31.94 


0.8938 


1.72 


0.5 


10.30 


10.11 


31.05 


0.8947 


1.91 


9.86 


10.65 


31.98 


0.8958 


1.99 


0.4 


7.99 


9.90 


31.06 


0.8971 


2.46 


7.71 


10.49 


31.99 


0.8956 


2.55 


0.3 


6.40 


9.66 


31.02 


0.8957 


3.07 


6.05 


10.45 


31.88 


0.8943 


3.25 


0.2 


4.31 


9.47 


30.88 


0.8936 


4.56 


4.09 


10.20 


31.77 


0.8927 


4.80 


0.1 


2.39 


9.09 


30.48 


0.8891 


8.22 


2.29 


9.88 


31.47 


0.8877 


8.58 


0.08 


1.92 


9.03 


30.32 


0.8872 


10.23 


1.94 


9.68 


31.36 


0.8870 


10.12 


0.06 


1.56 


9.02 


30.13 


0.8843 


12.59 


1.51 


9.54 


31.13 


0.8839 


13.01 


0.04 


1.11 


8.81 


29.89 


0.8805 


17.69 


1.14 


9.29 


30.90 


0.8811 


17.23 


0.02 


0.67 


8.73 


29.53 


0.8737 


29.31 


0.70 


9.05 


30.43 


0.8752 


28.06 


0.008 


0.45 


8.63 


29.24 


0.8683 


43.64 


0.47 


8.82 


29.95 


0.8690 


41.79 


0.006 


0.39 


8.50 


29.13 


0.8655 


50.36 


0.41 


8.76 


29.56 


0.8631 


47.90 


0.004 


0.36 


8.33 


29.03 


0.8637 


54.56 


0.36 


8.80 


29.06 


0.8567 


54.56 




0,4 0.6 

Param^e-r «i 

Figure 4. Effect of parameter a on encoding time. 

The CR decreases slightly when SF<0.2 (0.99 for Lena, 
0.78 for Peppers and 0.35 for Baboun). When SF increases, the 
CR decreases. A higher SF is accompanied with a high 
decrease of CR. This could be explained by the fact that some 
large range blocks could be covered well by some domain 
blocks which were excluded from the domain pool. Therefore, 
these large range blocks are subdivided in four quadrants 
resulting in a decrease of CR. For example, when SF«42 the 
drops of CR are 1.83, 2.16 and 1.17 for Lena, Peppers and 
Baboun respectively. For comparison, the full search reaches a 
PSNR of 30.92 dB with a required time of 19.64 seconds for 
Lena image. In the proposed method, the encoding time of 
Lena image is 1.11 seconds while the PSNR is 29.89 dB when 
oc=0.04. The speedup factor attains 17.69 with a drop of PSNR 
of 1.03. 

For visual comparison, fig. 5, fig. 6 and fig. 7 shows 
examples of reconstructed images encoded using full search 
and the proposed method. 



33 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 8, No. 8, November 2010 





Figure 5. Reconstructed image Lena by full search (a) and by the proposed 
method (b) when SF = 17.69. 



Figure 7. Reconstructed image Baboun by full search (a) and by the 
proposed method (b) when SF = 18.32. 





Fig. 8, fig. 9 and fig. 10 show that PSNR and MSSIM vary 
in the same way according to the encoding time for the test 
images. 





(b) 
Figure 8. Encoding time versus PSNR (a) and MSSEVI (b) for Lena. 



Figure 6. Reconstructed image Peppers by full search (a) and by the 
proposed method (b) when SF = 17.23. 
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(a) 




(b) 
Figure 9. Encoding time versus PSNR (a) and MSSEVI (b) for Peppers. 




Figure 10. Encoding time versus PSNR (a) and MSSEVI (b) for Baboun. 



When comparing our method to Hassaballah et al. method 
(HM), we find that our method preserve well the image quality 
and mark a slight decrease of CR for a high speedup factor than 
HM. For Lena image, a speedup factor of 10.23 is reached with 
a drop of 0.6 dB and a decrease of the CR by 1.43 while HM 
cause a drop of PSNR of 3.69 dB and a drop of CR of 3.28. For 
the same SF, the drop of MSSIM is of 0.024 by our method and 
0.0668 by HM. Furthermore, the results of encoding are still 
better than HM when the SF achieve a high value for the three 
test images. 

The comparison with Saupe's method (SM) shows an 
improvement of the encoding time without drops of PSNR, 
MSSIM and CR. For example, with Lena image, a SF of 29.31 
results in a PSNR of 29.53 dB, a CR of 8.73 and a MSSIM of 
0.8737. While by SM a SF of 21.81 generate a PSNR of 29.30 
dB, a CR of 8.51 and a MSSIM of 0.8654. 



Also the comparison with AP2D-ENT [26] shows that the 
proposed method reaches higher SFs with lesser drops of 
PSNR and of MSSIM than AP2D-ENT. Indeed for Lena 
image, a SF of 43.64 generates a PSNR of 29.24 dB, a CR of 
8.63 and a MSSIM of 0.8683. While the SF of 24.76 obtained 
by AP2D-ENT, generates a PSNR of 28.63 dB, a CR of 8.84 
and MSSIM of 0.8507. Similar improvements are observed for 
Peppers and Baboun images. 

IV. Conclusion 



In this study, we propose to reduce the time of fractal image 
encoding by using a new method based on the Haar wavelet 
transform (HWT). The two HWT horizontal and vertical 
coefficients obtained at the last level of pyramidal 
decomposition are used to determine the smooth or 
heterogeneous type of domain blocks. The proposed method 
reduces the encoding time by removing the smooth domain 
blocks from the domain pool. Experimental results show that 
discarding a fraction of smooth blocks has little effect on the 
image quality while a high speedup factor is reached. 
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Abstract — An improved method for noise estimation of speech 
utterances which are disturbed by additive noise is presented in 
this paper. Here, we introduce degree of noise refinement of 
minima value sequence (MVS) and some additional techniques 
for noise estimation. Initially, noise is estimated from the valleys 
of the spectrum based on the harmonic properties of noisy 
speech, called MVS. However, the valleys of the spectrum are not 
pronounced enough to warrant reliable noise estimates. We, 
therefore, initially use the estimated Degree of Noise (DON) to 
adjust the MVS level. For every English phoneme DON is 
calculated and averaged within those processing frames for the 
each input SNR. We consider this calculated average DONs as 
standard value corresponding to the input SNR which is aligned 
with the true DON using the least-squares (LS) method results a 
function to estimate the degree of noise. Therefore, using the 
technique, it is possible to estimate the state of the added noise 
more accurately. We use two stage refinements of estimated DON 
to update the MVS as well as to estimate a nonlinear weight for 
noise subtraction. The performance of the proposed noise 
estimation is good when it is integrated with the speech 
enhancement technique. 

Keywords-component; Noise Estimation, the Degree of Noise, 
Speech Enhancement, Nonlinear Weighted Noise Subtraction 



I. 



Introduction 



Noise estimation is one of the most important aspects for 
single channel speech enhancement. Usually in single-channel 
speech enhancement systems, most algorithms require a voice 
activity detector (VAD) and the speech/pause detection plays 
the major role in the performance of the whole system. 
However, these systems can perform well for voiced speech 
and high signal-to-noise ratio (SNR), but their performance 
degrades with unvoiced speech in low SNR. 

Traditional noise estimators are based on voice activity 
detectors (VAD) which are difficult to tune and their 
application to low SNR speech results often in clipped speech. 
The original MMSE-STSA estimates the noise power spectrum 
on the basis of the noisy speech only in the first non-speech 
period where the pure noise is available [1]. However, these 
systems can perform well only for voiced speech and high 
SNR. Martin (2001) proposed a method for estimating the 
noise spectrum based on tracking the minimum of the noisy. 
The main drawback of this method is it fails to update the noise 
spectrum when the noise floor increases abruptly [2]. Cohen 
(2002) [3] proposed a minima controlled recursive algorithm 
(MCRA) which updates the noise estimate by tracking the 



noise-only regions of the noisy speech spectrum. In the 
improved MCRA approach (Cohen 2003) [4], a different 
method was used to track the noise-only regions of the 
spectrum based on the estimated speech-presence probability. 
Doblinger [5] updated the noise estimate by continuously 
tracking the minimum of the noisy speech in each frequency 
bin. As such, it is computationally more efficient than the 
method in Martin 2001. However, it fails to differentiate 
between an increase in noise floor and increase in speech 
power. Hirsch and Ehrlicher [6] updated the noise estimate by 
comparing the noisy speech power spectrum to the past noise 
estimate. This method fails to update the noise when the noise 
floor increases abruptly and stays at that level. In our previous 
study, Hamid (2007) [7] proposed the noise estimation by 
using the MVS. The noise floor is updated with the help of 
estimated DON. Here DON is estimated on the basis of pitch 
and the pitch of unvoiced sections is not accurately estimated. 

In this paper, we propose a method which has good noise 
tracking and controlling capability. To estimate noise, first we 
search for the valleys of the amplitude spectrum on a frame by 
frame basis and estimate minima values of the spectrum, called 
minima value sequence (MVS). To improve the estimation 
accuracy of MVS, we use DON. As it is a single-channel 
method, direct estimation of the degree of noise is not possible. 
For that, frame wise averaged DON is estimated from the 
estimated noise of the observed signal. We have considered 
these DONs as standard value corresponding to the input SNR. 
Then each of these estimated 1 st averaged DONs for 
corresponding input SNR is aligned with the true DON using 
the least-squares (LS) method results the 1 st estimated degree 
of noise (DON1) of that frame. The 1 st estimated DON1 is 
applied to update the MVS. Next, the noise level is re- 
estimated and from the estimated noise, we again estimate 2 1 
averaged DON and similarly get the 2 nd estimated DON2. We 
used the 2 nd estimated DON2 to estimate the weight for noise 
subtraction process. Because noise is estimated from the 
estimated DONs, which is obtained from the true DON, so it is 
possible to estimate noise amplitudes in more accurate form 
with lower speech distortion and able to suppress musical noise 
in the enhanced speech. 



nd 



II. 



Proposed noise estimation method 



We have assumed that speech and noise are uncorrelated to 
each other. Let y(n)=s(n)+d(n), where y(n) is the observed 
noisy speech signal, s(n) is the clean speech signal and d(n) 
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is the additive noise. We further assume that signal and 
noise are statistically independent. Under the above 
assumptions, we can write the powers as P y =P s +P d . 

A Estimation of the minima value sequence (MVS) 

The sections of consecutive samples are used as a single 
frame / (320 samples). Consecutive frames are spaced /' (100 
samples) achieving an almost 62.75% overlap between them. 
The short-term representation of a signal y(n) is obtained by 
windowing (Hamming window) and analyzed using iV=512 
point discrete-Fourier transform (DFT) in sampling frequency 
16KHz. Initially, noise spectrum is estimated from the valleys 
of the amplitude spectrum and we assume that the peaks 
correspond to voice parts and valleys are the noise only parts. 
The algorithm for noise estimation is as follows: 

1. Compute the RMS value Y rms of the amplitude 
spectrum Y(k). We detect the minima 
Y mia (k m J^mm(Y(k)) values of Y(k) when the 
following condition (Y(k)<Y(k-l) and Y(k)<Y(k+l) and 
Y(k)<Y ms ) is satisfied. The k min expresses the positions 
of the frequency bin index of minima values. 

2. Interpolate between adjoining minima positions 

(^mm <r ~ K ) to obtain the minimum value sequences 

(MVS) Y min (k). 

3. We smooth the sequences by taking partial average 
called smoothed minimum value sequences (SMVS). 
This process continuously updates the estimation of 
noise among every analysis frames. 

An estimation of noise from the SMVS is survived by an 
overestimation and underestimation of the SNR. To achieve 
good tracking capability with controlled overestimation 
problem, the proposed noise estimation algorithm adopting the 
concept of DON. The block diagram of the noise estimation 
process is given in Figure 1. 

B. Estimation of the Degree of Noise (DON) 

In a single-channel method, we only know the power of the 
observed signal. Therefore, direct estimation of the degree of 



noise ( 



PJP„ h 



) is not possible. For that, frame wise DON is 



estimated from the estimated noise of the observed signal of 
each frame m. For optimal estimation of DON, we are carried 
out our experiment on 20 vowel phonemes of 3 male and 3 
female taken from TIMIT database. First white noise of various 
SNR are added to these voiced vowel phonemes. Then for each 
SNR white noisy phonemes are processed frame wise and 
DON is estimated in each frame for each phoneme 
individually. For every phoneme DON is averaged within those 
processing frames for the corresponding input SNR. Then each 
of these estimated 1 st averaged DONs of each frame m for 

corresponding input SNR expressed as Zlm . This Zlm is aligned 
with the true DON (Z fr ) using the least-squares (LS) method 
results the 1 st estimated degree of noise (DON1) Z lm of that 
frame. The true DON (Z^) is given by 



1 



R+P. 



dB 

-10 10 
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Figure 1. Block diagram of the 2nd estimated DON1, 



1 st averaged estimated DON 
_ _ 1 « P n (m) 



MtiP obs (m) 



(2) 



where, M are the noise added frames; P^(m)and P bs( m ) are 
the powers of noise and observed signals, respectively. Here it 
obvious that we consider only the voiced phonemes in our 
experiment. So the averaged DON value should be limited to 
voiced portion of a speech sentence. But practically the 
unvoiced portion contaminated with higher degree of noise. 
Hence the estimated noise is higher for unvoiced frame than 
from voiced frame. Consequently higher DON value is 
obtained from unvoiced frame than from voiced frame that is 
logically resemblance. The degree of noise estimated from a 
previously prepared function using least square method is given 
by [7] 



QxZ lm + b 



(3) 



(1) 



where Z lm is the 1 st estimated DON1 of frame m. The error 
between the true and the estimated values can be minimized by 
tuning a, b. In the experiment, 20 phoneme sounds for 3 male 
and 3 female degraded by the white noise in different SNRs 
(-10,-5,0,.... ,30 dB) is considered. The value of Z Xm is applied 
to update the MVS. Next, the noise level is re-estimated with 
the help of Z\ m . Finally, from the estimated noise, we again 

estimate 2 nd averaged DON ( Zlm ) and similarly the 2 nd 
estimated DON2 (Z 2m ) which is used to estimate the noise 
weight for nonlinear weighted noise subtraction. 

We conduct an experiment on the noisy speech (white 
noise) utterance /water/ of a female speaker of various input 
SNRs and obtain the 1 st estimated DON1, Z lm and 2 nd estimated 
DON2, Z 2m . Figure 2 illustrates the frame wise true degree of 
noise calculated and the estimated degree of noise obtained in 
every analysis frame for different input SNRs. By adopting 
smoothing in the MVS, the overestimation problem is 
minimized and the effect of musical noise is reduced. In fact 
smoothing is performed to reduce the high frequency 
fluctuations. Since for speech most of the signal energy is 
concentrated in low frequencies, for that reason smoothing is 
reducing the high frequency components and gives increased 
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signal-to-noise ratio. The Fig. 3 shows, the true and the 
estimated degree of noise are almost equal in all SNRs. 
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introduce a nonlinear weighting factor to control the 
overestimation and minimizing the effect of residual noise. The 
NWNS is given by: 
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Figure 2a. True vs 1 st avg. DON (T) Figure 2b. True vs 2 nd avg. DON (T) 
and True vs 1 st estimated DON1 (B). and True vs 2 nd estimated DON2 (B). 
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Figure 3. Frame wise graphical representations of the true (solid with point) and 
the 1 st estimated DON1 (dotted line with circle) and 2 nd estimated DON2 (solid 
line with double linewidth) for -5dB (left) and , 5dB (right) SNR noisy speech. 



C. Estimation of Noise Spectrum 

The noise spectrum is estimated from the SMVS and 1 st 
estimated DON according to the condition 

^ n (k) + (^xY r J 



D m (k)-- 



(4) 



Then we made some updates of D m (k), the updated spectrum is 
again smoothed by three point moving average, and lastly the 
main maximum of the spectrum is identified and are 
suppressed [7]. 

III. Weighted noise subtraction (NWNS) 

Noise reduction based on implementation of the traditional 
spectral subtraction (SS) require an available estimation of the 
embedded noise, here, in time domain we named Noise 
Subtraction (NS). It is observed that, in NS, degradation occurs 
for overestimation of noise within the unvoiced region of noisy 
speech at higher input SNR (>10 dB). We manually seen that 
the unvoiced region provides flat spectrum characteristics and 
exhibits low SNR that gives more degree of noise value that 
increases the noise level. Therefore, the extracted noise in 
unvoiced region is high and degrades the speech. From Figure 
4, it is seen that the unvoiced frame of higher SNR (>10 dB) 
input noisy speech provides flat spectrum and low SNR that 
gives more DON2 (Z 2m ) value that increases weighting factor. 
So more noise has subtracted at every unvoiced frame than 
from every voiced frame, say at 25 dB SNR input speech. 
Consequently speech distortion has to be occurred. For that, we 



S[(n) =y(n)-^axZ tr x d ss (n) 



(5) 



-14.109xZ 2 2 m +9.8273xZ 2 3 m 



is 



where « = 0-3019+ 6.4021xZ 2m 
nonlinear weighting factor. 

It is observed from Eq. (5) that it needs the input SNR. The 
input SNR can be estimated using variance is given by 



SNR, 



101og 10 



(6) 



where, s and * are the variances of speech and noise, 
respectively. We assume that due to the independency of noise 
and speech, the variance of the noisy speech is equal to the sum 
of the speech variance and noise variance. It is found that by 
adopting nonlinear weighted in NS, a good noise reduction is 
obtained. Although with the NWNS, we find the good 
performance with less musical noise by informal listening test. 



Cb) SNR=5dB Flatness=60 
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Figure 4. The depictions spectrums of voiced and unvoiced frames degraded by 
white noise at 5dB SNR is shown in (a) and (b), lOdB SNR is shown (c) and 
(d), 20dB SNR is shown in (e) and (f), 30dB SNR is shown in (g) and (h) 
respectively. 

A Derievation ofnon linear weight 

It is observed that the outcome of the subtraction type 
algorithms produce musical noise and that cannot be avoided. 
Since algorithms with fixed subtraction parameters are unable 
to adapt well to the varying noise levels and characteristics, 
therefore it becomes imperative to estimate a suitable factor to 
update the noise level. Hence we derive a nonlinear weighting 
factor a for this purpose. First, simulation is performed over 7 
males and 7 females speakers of different sentences at 
different SNR levels, randomly selected from the TIMIT 
database, for different values of a and record the output SNR. 
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Table 1 shows the performance of computer simulation of the 
algorithm of a given noisy sentence of a female speaker for 
different values of a. 

TABLE 1: The output SNR for a noisy speech of a female speaker for 
different values of a. for wide range of input SNR (-10dB to 30dB). The 
speech is degraded by white noise nose. 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 8, No. 8, November 2010 
9 



Input SNR— ►■ 


-lOdB 


-5dB 


OdB 


5dE 


lOdB 


15dB 


20dB 


25dB 


30dB 


Values of oc 


0.01 


-9.9456 


-4.9444 


0.05659 


5.0557 


10.05 


15.045 


20.043 


25.041 


30.033 


0.02 


-9.391 


-4.3336 


0.11347 


5.1117 


10.1 


15.09 


20.035 


25.079 


30.069 


0.03 


-9.336 


-4.3324 


0.17066 


5.163 


10.151 


15.134 


20.126 


25.116 


30.094 


0.04 


-9.7308 


-4.7759 


0.22315 


5.2244 


10.201 


15.179 


20.167 


25.15 


30.113 


0.05 


-9.7252 


-4.7191 


0.23594 


5.2312 


10.252 


15.223 


20.207 


25.183 


30.126 


(Kit 


-9.6691 


-4.662 


0.34404 


5.3382 


10.302 


15.268 


20.247 


25.213 


30132 


0.07 


-9.6131 


-4.6045 


0.40245 


5.3954 


10.353 


15.312 


20.236 


25.242 


30.131 


0.03 


-9.5566 


-4.5463 


0.46117 


5.4529 


10.404 


15.356 


20.324 


25.268 


30.124 


0.09 


-9.4993 


-4.4337 


0.52021 


5.5106 


10.455 


15.4 


20.361 


25.291 


30.11 


0.1 


-9.4426 


-4.4303 


0.57955 


5.5636 


10.506 


15.444 


20.393 


25.313 


30.09 


02 


-3.8528 


-3.328 


1.1907 


6.1623 


11.021 


15.373 


20.723 


25.395 


29.563 


0.3 


-8.2277 


-3.1904 


1.335 


6.73 


11.541 


16.277 


20.955 


25.233 


28.615 


0.4 


-7.5644 


-2.5153 


2.5132 


7.419 


12.059 


16.643 


21.077 


24.353 


27.465 


e.5 


-6.8602 


-1.8012 


3.2245 


3.0736 


12.564 


16.96 


21X79 


24.307 


26.23 


0.6 


-6.1134 


-1.0431 


3.9654 


3.7342 


13.044 


17.211 


20.961 


23.652 


25.137 


0.7 


-5.324 


-0.25903 


4.7275 


9.335 


13.432 


17.384 


20.733 


22.941 


24.069 


o 


-4.4953 


0.55759 


5.4943 


10.002 


13.353 


17.47 


20.411 


22.209 


23.032 


0.9 


-3.6392 


1.334 


6.2397 


10.552 


14.151 


17.462 


20.015 


21.431 


22.172 


1 


-2.7765 


2.1379 


6.919 


10.995 


14.34 


17.362 


19.565 


20.772 


21.333 


1.1 


-1.9436 


2.9164 


7.4733 


11.286 


14412 


17.175 


19.031 


20.039 


20.553 


1.2 


-1.2213 


3.4954 


7.8352 


11.393 


14.359 


16.912 


13.576 


19.435 


19.339 


1.3 


■0.63597 


3.3413 


7. 9487 


11.3 


14.133 


16.537 


13.063 


13.313 


19.17 


1.4 


43312 


3 891 


7.7942 


11.021 


13.91 


16.213 


17.549 


13.221 


13.544 


1.5 


■0.51612 


3.6343 


7.3982 


10.588 


13.545 


15.305 


17.042 


17.653 


17.953 


1.6 


■0.91692 


3.1205 


6.32 


10.044 


13.116 


15.373 


16.545 


17.123 


17.406 



TABLE 2: The average weight of a for 7 male and 7 female utterances 
corresponding to wide range of input SNR (-10dB to 30dB). 



Input SNR-i 


-10<1B 


-5<1B 


OdB 


5(1B 


10<1B 


15JB 


:o<ib 


25(1B 


30<1B 


Speaker 


fbjfo 


1.4 


1.4 


1.3 


1.2 


1.1 


0.8 


0.5 


0.2 


0.06 


magvo 


1.4 


1.3 


1.1 


0.9 


0.6 


0.3 


0.1 


0.03 


0.01 


fdrnlo 


1.4 


1.4 


1.3 


1.2 


1.1 


0.8 


0.4 


0.2 


0.05 


rnakro 


1.4 


1.3 


1.2 


1.1 


0.7 


0.4 


0.1 


0.05 


0.02 


fltmo 


1.4 


1.4 


1.3 


1.1 


0.9 


0.6 


0.3 


0.1 


0.03 


mtiso 


1.4 


1.4 


1.3 


1.2 


0.9 


0.5 


0.2 


0.08 


0.03 


fmjfb 


1.4 


1.4 


1.3 


1.3 


1.3 


1.1 


0.7 


0.4 


0.1 


fntbo 


1.4 


1.3 


1.3 


1.1 


0.9 


0.6 


0.2 


0.09 


0.03 


rncalo 


1.4 


1.3 


1.2 


0.9 


0.6 


0.3 


0.1 


0.04 


0.01 


mdmto 


1.4 


1.3 


1.2 


1.0 


0.6 


0.3 


0.1 


0.04 


0.01 


fvrnho 


1.4 


1.4 


1.3 


1.3 


1.1 


0.9 


0.5 


0.2 


0.08 


mdpko 


1.4 


1.3 


1.2 


1.0 


0.6 


0.3 


0.1 


0.04 


0.01 


fj^azo 


1.4 


1.4 


1.3 


1.2 


1.0 


0.7 


0.3 


0.1 


0.04 


mklwo 


1.4 


1.3 


1.2 


1.0 


0.7 


0.3 


0.1 


0.05 


0.01 


Average 


1.4 


1.35 


1.25 


1.1071 


0.36429 


0.56429 


0.26429 


0.11571 


0.035 



Let the set of data points (x,- , yi), i= 1,2,. . .,9 and the curve 
given by Y = f(x) be fitted for this data.. At x = x h the 
experimental value of the ordinate is y, and the corresponding 
value on the fitting curve is f(x,). If e, is the error of 

approximation at x = x h then e /~^z~/l x z) ? men the 
summation of the square of the errors is given by 



se = X« 



z=l 



We consider a is a polynomial of degree 3 . 
Then the 3 rd degree polynomials are: 

a = f(x) = a +a l x + a 2 x 2 +a 3 x 3 } 

be fitted to the data points (x z - , y,), z = 1,2,. .. .,9. 

x represents the values of DON2. 

The summation of errors at x = x, is given by 



S£ = Z t.-k+a, 



a 3 Xi 3 )] 2 



For SE to be minimum, we have 

3a n t? 



o+a^. +a 2 x ; . +a 3 x;j 



~T^- = " 2 S bi ~ ( a + Q l*i + <*2*t + Q 3*i )] * f = 



da v 

d(SE) 

da. 



9 L 

- 2 Sbi -( a o + a i*i +a 2 x 2 +a 3 x 3 )] x? =0 



^ = -2±[ yi -{a 0+ai x i+ a 2 xf + a 3 xf)]^=0 
oa^ /=1 



From Eq. (9), (10), (1 1) and (12) we have, 

9 999 

2>. =9a +a 1 2> i +a2Z x . 2+a 3Z x . 3 

z=l z=l z=l z=l 

9 9 9 9 9 

z=l z=l r=l z=l i=l 

9 9 9 9 9 

ZXVi = a oZ x ' 2 +a iZ x i 3 +a 2 Z x i 4 +a 3 Z x i 5 
[=1 [=1 [=1 1=1 [=1 

9 9999 

E x . 3 >'.- = a o S x . 3 + a i S x . 4 + a 2 E x . 5 + a 3 E x . 6 

1=1 1=1 Z=l 1=1 z=l 

We write these equations in a matrix form as: 



9 

z'-l 




9 

S X ^z 

z'-l 




z'-l 




I>, 3 y, 




L i=i J 





9 2> Ix, 2 X x ? 

z'=l z'=l z'=l 

9 9 9 9 

S*. 2X E x , 3 E x , 4 

z'=l z'=l z'=l z'=l 

9 9 9 9 

^ 2>, 3 E x , 4 E x - 5 

z'=l z'=l z'=l z'=l 

9 9 9 9 

Z^, 3 IX E x f I x - 6 



(7) 



(8) 



(9) 
(10) 



(11) 
(12) 



(13) 
(14) 
(15) 
(16) 



(17) 
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Eq.(17) is a Vander monde matrix. We can also obtain the 
matrix for a least squares fit by writing: 



X 2 
X 3 

x 4 

X 5 

X 6 
X l 
*8 



*r 






4 




a, 


A 






x l 




"i 


x l 
K 




a 2 


4 
4 




a 3 


4 







(18) 



In matrix notation, Eq.(18) can be written as: 

Y=XA 

where, 



(19) 



y 2 
y 3 
y 4 
y 5 
y 6 
y 7 
y s 



x = 



X 2 


X 2 


X 


X 3 


4 


X~ 


X 4 


4 


K 


X 5 


4 


x : 


X 6 


X l 


x < 


X 7 


4 


X 


X * 


X l 


x* 


Xq 


4 


X t 



and A-- 



Multiply both sides of Eq.(19) by X (transpose of X) 



X T Y = X T XA 



(20) 



This matrix equation can be solved numerically, or can be 
inverted directly it is well formed, to yield the solution vector 



(21) 



A=(x T xYx T Y 
In our experiment, 

x- = DON2 i = [0.80707,0.75235,0.59967, 0.32374,0.095033,0.01379 ,-0.009902,-0.01741,-0.019616] 

y . =a t =[l.4, 1.35, 1.25, 1.1071, 0.86429, 0.56429, 0.26429, 0.11571, 0.035] 

So, we have 



~ a l~ 


a 2 


a 3 


a A 


a 5 


a 6 


a 7 


a, 


a 9 



DON2 } DON2] DON2\ 

DON2 2 DON2\ DON2\ 

DON2 3 DON 2] DON2] 

DON2 4 DON2\ DON 2] 

DON2 5 DON2] DON2] and Y - 

DON2 6 DON2 2 6 DON2\ 

DON 2, DON 2] DON 2] 

DOiV2 8 DON2\ DON2\ 

DON2 Q DONll DON21 



Finally we put the values of DON2i,....DON2 9 to get X and 

put the value of a h ,a 9 to get Y. Therefore, from Eq.(21), 

we have 

"0.3019 
6.4021 
-14.109 
9.8273 

Now substitute the value of ° ' l ' 2 3 in Eq.(7) 



A = 



a = 0.3019 + 6.4021x DON! - 



(22) 



Equation (22) is the derivation of the nonlinear weighting 
factor a and is used in Eq. 5. 

IV. Experimental results and discussion 

The proposed noise estimation method is compared with 
the conventional noise estimation algorithm using MVS in 
terms of noise estimation accuracy and quality. Figures 5 
illustrate results of noise estimation in frequency domain (FD) 
measure. In the experiment, we consider the vowel phoneme 
sound /oy/, degraded by the white noise at OdB SNR. It shows 
that, by adopting the proposed DON1 (Z im ), it is possible to 
estimate the state of the added noise more precisely. We 
achieve sufficient improvements in noise amplitudes using the 
MVS+DON1 estimator. Objective measure is also performed 
to verify the quality of the estimated noise. For that we use the 
PESQ MOS measure. Figure 6 shows the PESQ MOS value 
between the added and the estimated noise at different noise 
levels. It shows that PESQ MOS value gradually decreases at 
the higher SNR. 

To study the speech enhancement performance, an 
experiment is carried out by taking 56320 samples of the clean 
speech /she had your dark suit in greasy wash water all year/ 
from TIMIT database. The speech signal is corrupted by white, 
pink and HF channel noises at various SNR levels are taken 
from NOISEX database. The results of the average output SNR 
obtained from for white noise, pink noise and HF channel noise 
at various SNR levels are given in Table 1 for NS and NWNS, 
respectively. 
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Figure 5. Noise spectrums (original and estimated). 
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amplitudes by the estimated D0N1. It eliminates the need for a 
VAD by exploiting the short time characteristics of speech 
signals. In the result part, it is shown that the state of the added 
noise is more accurate with MVS+D0N1. The enhanced 
speech using time domain nonlinear weighted noise subtraction 
results in sufficient noise reduction. The main advantage of the 
algorithm is the effective removal of the noise components for 
a wide range of SNRs. We not only have better SNR but also a 
better speech quality with significantly reduced residual noise. 
However, a little noisy effect still remains. This issue will be 
addressed in our future study. 
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We observe from the Tables 3 that the overall output SNR 
by NS is improved up to 10 dB input SNR and degraded from 
15 dB and higher. Degradation occurs for overestimation of 
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DON2 value that increases the noise level. Consequently the 
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degrade the speech. Hence it is essential to add a weighting 
factor to control the overestimation and we have a better 
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the enhanced speech is distorted in low voiced parts due to 
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TABLE 3: The results of average output SNR for various types of noise at 
different input SNR by the NS and NWNS methods. 



Input 
SNR 


White noise 


HF channel noise 


Pink noise 


NS 


NWNS 


NS 


NWNS 


NS 


NWNS 


-lOdB 


-2.8 


-1.57 


■7.4 


-7.5 


■7.1 


■7.1 


-5dB 


2.0 


2.4 


■2.3 


-2.7 


■2.2 


-2.3 


OdB 


6.5 


5.3 


2.6 


1.9 


2.6 


2.2 


5dB 
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8.7 


7.3 
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7.3 
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lOdB 


13.3 
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11.5 
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10.8 


15dB 
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15.8 


14.5 
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15.4 


20dB 


16.7 
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16.3 


20.3 


25dB 


17.5 


25.2 


17.3 


25.1 


17.3 


25.2 


30dB 


17.7 


30.1 


17.7 


30.1 


17.6 


30.1 



Conclusions 

In this paper, an improved noise estimation technique is 
discussed. Initially noise is estimated from the valleys of the 
amplitude spectrum. Then we have adjusted the estimated noise 
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Abstract — Resource limitation and variety of network and users 
cause many obstacles while transmitting data especially online 
video data through network. Video applications in Internet face 
by significant growth in several market segments and bandwidth 
limitation is one of those challenges which consider as a main 
obstacle in this paper. 



bandwidth requirement of video lead to have a proper resource 
management to reach the real time performance. In section II 
five existing CODEC standards are defined. Comparison points 
and their performance through transmitting online video 
streams are noted in a table at section III. Finally, the 
conclusion has been presented. 



Keywords-component; bandwidth limitation, video codec, video 
conferencing, SVC 



II. 



CODEC STANDARDS 



I. 



Introduction 



During the last few years the Internet has grown 
tremendously and has penetrated all aspects of everyday life. 
Therefore, people are willing to communicate and exchange the 
information due to update their knowledge in different aspects. 
In this case, live connection among people in different location 
around the world comes to demand. Interactive video services 
like video conferencing based on online education, distance 
learning, online video games over the Internet are mostly 
popular now a days. Meanwhile, modern video transmission 
works mainly based on Real Time Transport Protocol (RTP/IP) 
for real time services [1]. RTP mainly is an Internet protocol to 
transmit real time multimedia data over either unicast or 
multicast network services. 

Typically, video data need to be compressed and decoded 
by video CODECs in the real time applications. CODECs are 
one of the common solutions to adapt video streams with low 
bandwidth over the network. Bandwidth limitation is a main 
obstacle faces by video streams through network especially in 
heterogeneous networks. [4], [8], [10], [12], [9]. In fact, video 
CODECs are computer programs for compressing video due to 
reduce bandwidth requirements. After that, they transmit those 
compressed streams and decrease the storage requirements to 
archive them easily. Solving the existed mismatches between 
bandwidth and computational requirement help to identify the 
minimum channel bandwidth required to pass encoded stream 
and minimize the specification of decoding device. In general, 
the video is encoded with CODEC once as a stream. Later, the 
resulting stream would produce a full resolution video in 
decode step [4] . 

This paper focuses on the bandwidth limitation of internet 
through transmitting online video data as the main concern. Not 
only bandwidth limitation on current Internet but also, extreme 



A codec is whether a program or a device to encode or 
decode a digital data stream or a signal. Normally, a codec use 
to encode the streams for transmission, storage or encryption. 
On the other hand, a stream may be decoded for playback or 
editing. The major video coding standards have been created 
since 1990s. They are mostly formulated based on the same 
generic design of a video CODEC that combines a motion 
estimation and compensation front end, a transform stage and 
an entropy encoder. Generally, each standard explains two 
terms such as: 

• Compressed form visual data known as coded syntax. 

• The method of decoding the syntax to rearrange the 
visual information. 

The main target of each standard is to make sure both 
encoders and decoders are compatible to work properly with 
each other. Otherwise, suppliers are not able to develop proper 
products [2]. 

There are different types of CODECs in the market since 
1990. They got huge changes facing by new production 
regarding to their new requirements each time. In this paper 
MPEG-2, H.263, MPEG-4 Visual, H.264 and SVC are 
discussed. 

A. MPEG-2 

MPEG-2 mostly is used for broadcasting digital TV via 
cable, DVD-Video and MPEG Layer 3 audio coding known as 
MP3 which is became popular in terms of storage and sharing 
music. MPEG-2 introduced the idea of Profiles and Levels for 
the first time without restricting the flexibility of the standard. 
In addition, MPEG-2 applications were looking for the proper 
standard for the next generation of products [2]. 
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B. H.263 



The original H.263 standard has published as a standard in 
1995. Its powerful compression was supporting basic video 
quality at bitrates of below 30 kbit/s. H.263 is compatible with 
the standards over wide range of circuit and packet switched 
networks. H.263 contains four optional coding modes and 
added some extra modes to support improved compression 
efficiency and robust transmission over lossy networks [2]. 

C. MPEG-4 Visual 

MPEG-4 is Part2 of the MPEG-4 group of standards. It is 
developed by Moving Picture Experts Group known as 
(MPEG). It works on coding the audio-visual objects and can 
support those applications which are noted as below: 

• Legacy video applications like TV broadcasting, video 
conferencing and video storage. 

• Object-based video applications that a video scene 
contains a combination of different distinct video 
which are coded independently. 

• Computer graphics which are using 2D and 3D 
deformable mesh geometry or human faces and bodies 
that are animated. 

• Hybrid video applications which are combining natural 
video, images and graphics generated by computer. 

• Streaming video over the Internet and mobile channels. 

• High-quality video editing using for the studio 
production. 

In overall, MPEG-4 Visual is slightly simple video coding 
mechanism with block-based video CODEC using motion 
compensation followed by DCT, quantization and entropy 
coding [2]. 

D. H.264 codec 

H.264 is known as Advanced Video Coding which is a 
standard for codec visual data. It is actually designed to support 
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a strong and efficient coding and transmit of rectangular video 

frames. H.264 was created to improve the functionality of 

previous CODEC such as H.263 and MPEG-4. 



H.264 mostly uses to compress video in both commercial 
and military application. H.264 works based on MPEG-2, using 
macroblock-based motion prediction and allows more flexible 
encoding rather than MPEG-2. Furthermore, H.264 explains 
just bit stream syntax. Considering both error resilience and 
coding efficiency, different H.264 encoders may create 
different output [10]. The main applications contain: 



• Two-way video communication 
conferencing or video telephony. 



such as video 



• Coding for broadcast and high quality video. 

• Video streaming over packet networks [2]. 

E. Scalable Video Codec 

The Scalable Video Coding is an extension of H.264/ A VC 
which is created to control bandwidth and loss resilient video 
streaming. SVC is working as a multilayer predictive encoder. 
Therefore, users are able to adapt the received videos by 
extracting and decoding the code layers based on the capability 
of their own devices and network throughput [1], [11]. 

In Scalable Video Coding, decoder is able to decode just 
part of the decoded bitstream selectively. Apart from that, 
encoder arranges the coded stream based on layers including a 
base layer and one or more enhancement layers. For instance in 
(figure 1), decoder A can decode a basic quality version of 
video scene which is received from only the base layer. On the 
other hand, decoder B is receiving all layers and decodes a high 
quality version of the video. Furthermore, a number of 
applications come to demand for example, a low complexity 
decoder may able to decode the base layer; a low bitstream 
with restricted capacity may be extracted for transmission over 
a network segment and also an error sensitive base layer may 
be transmitted with higher priority in comparison with 
enhancement layers. 



>Base Layer- 



Video 
Sequence 



> Decoder A 



Basic-quality 
sequence 



~> encoder 



-> Enhancement- 
Layer 1 



Enhancement 
Layer N 



Decoder B 



high-quality 
sequence 



Figure 1 . Scalable coding: general concept [2] 



The Scalable Video CODEC influences the tools and concepts 
of original CODEC. In addition, SVC supports a number of 
scalable coding modes such as Spatial Scalability, Temporal 



Scalability, quality scalability as well as Fine Grain Scalability 
[1], [2], [4], [11], [12], [13]. It can cause different decoded 
video due to different frame rate, resolution or different quality. 
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1) Spatial Scalability 

Spatial Scalability is related to resolution of each frame. 
Typically, the base layer includes a reduced resolution of each 
coded frame. The output of base layer encoding is a low 
resolution sequence while encoding both base layer and 
enhancement layers creates a high resolution output. 
Meanwhile, spatial enhancement layers is coding deference 
information among the picture of up scaled reference layer and 
higher resolution of original picture increase a resolution of 
coded pictures [2] 

2) Temporal Scalability 

Temporal scalability can be used to create a high frame 
rate. Generally, it is related to the number of frames per second 
(fps) in video stream. Temporal enhancement layers encode 
additional pictures to increase the frame rate of reproduced 
video. However, it is not allowed to use those pictures as 
reference for spatial enhancement layers [3] 

3) Quality Scalability 

With Quality Scalability, the substream creates a complete 
bitstream with a spatio-temporal resolution and lower fidelity. 
Quality Scalability is often related to signal to noise Ratio 
(SNR) and it is mainly related to visual quality layers of the 
coded video by different bitrates [3]. 
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4) Fine Grain Scalability 
Fine Grain Scalability (FGS) increases the quality of 
sequence in small steps. A FGS application streams video 
through network that may be suitable to scale coded stream 
being match with available bit rate [2]. The main new features 
of SVC are as follow: 



Variable block-size motion-compensated prediction 
with the block size down to 4x4 pixels; 

Quarter-pixel motion vector accuracy; 

Multiple reference pictures for motion compensation; 

Bi-directional predicted picture as a reference for 
motion prediction; 

Intra-picture prediction in the spatial domain; 

Adaptive deblocking filter within the motion- 
compensated prediction loop; 

Small block-size transformation (4x4 block transform); 

Enhanced entropy coding methods: Context- Adaptive 
Variable-Length Coding (CAVLC) and Context- 
Adaptive Binary Arithmetic Coding (CAB AC)). 



TABLE I. 



Comparison of above mentioned CODECS 





MPEG-2 


H.263 


MPEG-4 Visual 


H.264 


SVC 


Data type 


Object base 
coding scheme 


Object base wide 
range of circuit and 
packet switched 
networks 


Rectangular video 
frames and fields 


Rectangular video 
frames and fields 


Rectangular video 
frames and fields 


Compression 
efficiency 


Loss efficiency 


Loss efficiency 


Medium 


High 


High 


Motion 
compensation 
Minimum block 
size 


8x8 


8x8 


8x8 


4x4 


4x4 



III. DISCUSSION 

CODEC'S improvements increase extremely along with 
multimedia developments. Considering above mentioned 
CODECs' capabilities, each CODEC has been adapted by the 
newest version of production at its own time. Therefore, 
MPEG-2 and H.263 stand on object base scheme where loss 
Compression efficiency is another issue. 

In MPEG-4 Visual and H.264 macroblock-based motion 
compensation is the core technology of video coding while 
transformation and quantization of residual data is considered 
[2]. Although, the compression efficiency of MPEG-4 Visual is 
medium in comparison with H.264 and SVC, still it works 
better than both H.263 and MPEG-2 with loss efficiency. 

SVC is able to create high compression efficiency coding 
which is the main requirement of online applications. In case, 
when a lower resolution or bandwidth stream is needed to 
reach the network and a lower performance device is aimed a 
small part of the decoded stream would be sent without any 



further processing. Current small stream would be easier to 
decode and result a video with low resolution. In this case, the 
encoded video stream would adapt itself to the bandwidth of 
the transport channel and to requirements of target device. This 
is the characteristic of Scalable Video CODEC. 

IV. CONCLUSION 

Although traditional systems for transmitting video data 
may have some scalable capability, but they still have some 
challenges due to less coding efficiency and complex decoder. 
Furthermore, different types of scalability may be combined 
and create large number of representations with different 
spatio- temporal resolutions. In this case, one single bit stream 
is able to support the bit rates. However, SVC supports is using 
multiple dimension scalable modes to support flexible 
bitstream. Otherwise, the main problem of SVC is that: how to 
guarantee user perceived quality (UPQ) which means that how 
to guarantee user satisfaction with video quality in video 
services. This is considered as future work [7]. 
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Abstract In this paper, a new method for offline handwritten B. Related works. 



signature retrieval is based on curvelet transform is proposed. 
Many applications in image processing require similarity 
retrieval of an image from a large collection of images. In such 
cases, image indexing becomes important for efficient 
organization and retrieval of images. These papers address this 
issue in the context of a database of handwritten signature images 
and describes a system for similarity retrieval. The proposed 
system uses a curvelet based texture features extraction .The 
performance of the system has been tested with an image 
database of 180 signatures. The results obtained indicate that the 
proposed system is able to identify signatures with great with 
accuracy even when a part of a signature is missing. 

Keywords- Handwritten recognition, Image indexing, Similarity 
retrieval, Signature verification, Signature identification. 



I. 



Motivation 



Introduction (Heading 1) 



A signature appears on many types of documents such as 
bank cheques in daily life and credit slips, thus signature has a 
great importance in a person's life. Automatic bank cheque 
processing is an active topic in the field of document analysis 
and processing. Signature validity confirmation of different 
document is one of the important problems in automatic 
document processing. Now a days, person identification and 
verification are very important in security and resource access 
control. For this purpose the first and simple way is to use 
Personal Identification Number (PIN), but PIN code may be 
forgotten. Now an interesting method to identification and 
verification is biometric approach [1]. Biometric is a measure 
for identification that is unique for each person. Always 
biometric is together with person and cannot be forgotten. In 
addition biometric usually cannot be misused. 

Handwritten signature retrieval is still a challenging work 
in the situations of a large database. Unlike fingerprint palm 
print and iris, signatures have significant amount of intra class 
variations making the research even more compelling. This 
approach with the potential applications of signature 
recognition/verification system optimized with efficient 
signature retrieval mechanism. 



Signature verification contain two areas: off-line signature 
verification ,where signature samples are scanned into image 
representation and on-line signature verification, where 
signature samples are collected from a digitizing tablet which 
is capable of pen movements during the writing .In our work, 
we survey the offline signature identification and retrieval . In 
2009, Ghandali and Moghaddam have proposed an off-line 
Persians signature identification and verification based on 
Image registration, DWT (Discrete Wavelet Transform) and 
fusion. They used DWT for features extraction and Euclidean 
distance for comparing features. It is language dependent 
method [1]. In 2008, Larkins and Mayo have introduced a 
person dependent off-line signature verification method that is 
based on Adaptive Feature Threshold (AFT) [2]. AFT 
enhances the method of converting a simple feature of 
signature to binary feature vector to improve its representative 
similarity with training signatures. They have used 
combination of spatial pyramid and equimass sampling grids 
to improve representation of a signature based on gradient 
direction. In classification phase, they used DWT and graph 
matching methods. In another work, Ramachandra et al [3], 
have proposed cross-validation for graph matching based off- 
line signature verification (CSMOSV) algorithm in which 
graph matching compares signatures and the Euclidean 
distance measures the dissimilarity between signatures. 
In 2007, Kovari et. al presented an approach for off-line 
signature verification, which was able to preserve and take 
usage of semantic information[4].They used position and 
direction of endpoints in features extraction phase. Porwik [5] 
introduced a three stages method for offline signature 
recognition. In this approach the hough transform , center of 
gravity and horizontal-vertical signature histogram have been 
employed, using both static and dynamic features that were 
processed by DWT has been addressed in[6].The verification 
phase of this method is based on fuzzy net using the enhanced 
version of the MDF(Modified Direction feature)extractor has 
been presented by Armand et.al [7]. The different neural 
classifier such as Resilient Back Propagation(RBP), Neural 
network and Radial Basis Function(RBF) network have been 
used in verification phase of this method. In 1995, Han and 
Sethi [8], described offline signature retrieval and use a set of 
geometrical and topological features to map a signature onto 
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2D strings. We have proposed an offline signature retrieval 
model based on global features. 

The main contribution of this paper is that, we have proposed 
off-line handwritten signature retrieval using curvelet 
transform, In retrieval phases Canberra distance measure is 
used. The experimental results of proposed method were 
satisfactory and found that it had better results compare with 
related works. The rest of paper is organized as follows: In 
section II, discusses the feature extraction phase. The signature 
retrieval is presented in section III. In section IV, the 
experimental results and finally section V concludes the 
work. 

II. FEATURE EXTRACTION PHASE 

The major task of feature extraction is to reduce image data to 
much smaller amount of data which represents the important 
characteristic of the image. In signature retrieval, edge 
information is very important in characterizing signature 
properties. Therefore we proposed to use the curvelet 
transform. The performance of the system is compared with 
standard discrete wavelet transform which captures 
information in only three directions. 

A. Discrete Wavelet Transform 

The multi resolution wavelet transform decomposes a 
signal into low pass and high pass information. The low pass 
information represents a smoothed version and the main body 
of the original data. The high pass information represents data 
of sharper variations and details. Discrete Wavelet Transform 
decomposes the image into four sub-images when one level of 
decomposing is used. One of these sub-images is a smoothed 
version of the original image corresponding to the low pass 
information and the other three ones are high pass information 
that represents the horizontal, vertical and diagonal edges of the 
image respectively. When two images are similar, their 
difference would be existed in high-frequency information. A 
DWT with N decomposition levels has 3N+1 frequency bands 
with 3N high-frequency bands [9], [10]. The impulse responses 
associated with 2-D discrete wavelet transform are illustrated in 
Fig. 1 as gray- scale image. 
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represent an edge to squared error 1/N requires 1/N wavelets. 
The curvelet transform, like the wavelet transform, is a 
multiscale transform, with frame elements indexed by scale and 
location parameters. Unlike the wavelet transform, it has 
directional parameters, and the curvelet pyramid contains 
elements with a very high degree of directional specificity. In 
addition, the curvelet transform is based on a certain 
anisotropic scaling principle which is quite different from the 
isotropic scaling of wavelets. The elements obey a special 
scaling law, where the length of the support of a frame 
elements and the width of the support are linked by the relation 
width ~ length 2 . see details in [1 1]. 

C. Feature Database Creation 

To construct the feature vectors of each handwritten 
signature in the database using DWT and curvelet transform 
respectively. The Energy and Standard Deviation (STD) were 
computed separately on each sub band and the feature vector 
was formed using these two parameter values. The Energy 




Fig. 1. Impulse response of °, 90° and ± 45° of DWT 



B. 



Curvelet Transform 

Recently, Candies and Donoho developed a new 
multiscale transform which they called the curvelet transform. 
Motivated by the needs of image analysis, it was nevertheless 
first proposed in the context of objects f(xl, x2) defined on the 
continuum plane (xl, x2) € R 2 . 

The transform was designed to represent edges and other 
singularities along curves much more efficiently than 
traditional transforms, i.e. using many fewer coefficients for a 
given accuracy of reconstruction. Roughly speaking, to 
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III. OFFLINE Handwritten SIGNATURE Retrieval 
PHASE 

There are several ways to work out the distance between 
two points in multidimensional space. The most commonly 
used is the Canberra distance measure. It can be considered the 
shortest distance between two points. We have used Canberra 
distance metric as similarity measure. If x and y are the feature 
vectors of the database and query signature, respectively, x and 
y have dimension d, then the Canberra distance is given by 



d 

Canb (x, y) — Y, 
i=l 
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(4) 
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Query image 



Algorithm 1 : Offline Handwritten Signature Retrieval 
Input: Test signature: St 

Feature database: FV 
Output: Distance vector: Dist 

Handwritten signature retrieval 
Begin 

Calculate feature vector of test signature using 
DWT and curvelet transform 
For each fv in FV do 

Dist= Calculate distance between test signature 
and fv using (4) 
sort Dist 
End for 

Display the top signature from dist vector. 
End 
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Fig.2. Sample Handwritten Signature Images Database 

B. Retrieval Performance 

For each experiment, one image was selected at random as the 
query image from each writer and thus retrieved images were 
obtained. For performance evaluation of the signature image 
retrieval system, it is significant to define a suitable metric. 
Two metrics are employed in our experiments as follows. 

Number of relevant signatures retrieved 

Recall = — - - - ; (5) 

Number of relevant signatures 



IV. EXPERIMENTAL RESULTS 

A. Image Database 

The signatures were collected using either black or blue ink 
(No pen brands were taken into consideration), on a white A4 
sheet of paper, with eight signature per page. A scanner 
subsequently digitized the eight signatures, contained on each 
page, with a resolution in 256 grey levels. Afterwards the 
images were cut and pasted in rectangular areas of size 
256x256 pixels. Sample signature database for 16 persons are 
shown in Fig.2. A group of 16 persons are selected for 12 
specimen signatures which make the total of 16x12=192 
signature database. 
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Number of relevant signatures retrieved 
Precision = (6) 

Number of signatures retrieved 

Results correspond to precision and recall rate for a Topi, Top 
2, Top 5, Top 8, Top 10, and Top 12. The comparative 
retrieval performance of the proposed system is shown in 
Table 1. 

Tablel: Average Retrieval Performance 





Discrete wavelet 
Transform 


Curvelet Transform 


Number of 
Top matches 


Precision 

% 


Recall 

% 


Precision 

% 


Recall 

% 


Topi 


100 


8 


100 


8 


Top 2 


80 


12.6 


96.6 


15.4 


Top 5 


66.7 


28.9 


92 


36.7 


Top 8 


55.8 


37.2 


73.3 


48.5 


Top 10 


51.3 


43.4 


70.7 


59.0 


Top 12 


47.8 


47.5 


66.04 


65.2 



Fig.2. Sample Signature Images Database 



Retrieval performance of the proposed method is compared 
using DWT transform technique. We evaluated the 
performance in terms of average rate of retrieving images as 
function of the number of top retrieved images. Fig.3 shows 
graph illustrating this comparison between DWT and curvelet 
transform according to the number of top matches considered 
for database. From Fig. 3, it is clear that the new method is 
superior to DWT. To retrieve images from the database those 
have a similar writing style to the original request. In Fig. 4, 
retrieval example results are presented in a list of images 
having a query image. 
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Fig. 3. Comparative average retrieval rate using DWT and Curvelet transform 
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Fig. 4. Sample handwritten signature retrieval example 

V. CONCLUSION 

Experimental were conducted for quick for retrieval of offline 
signature and result are presented. The retrieval performance 
of the proposed method based on edge correspondence is 
compared with the retrieval method based on DWT. The 
proposed method is simple, efficient and outperforms the 
retrieval system based on curvelet features respect to all 
parameters (Precision, Recall and Correct retrieval). The 
proposed approach used curvelet features for extracting details 
and Canberra distance for comparing features. 
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Abstract — Long term evolution (LTE) is designed for high 
speed data rate, higher spectral efficiency, and lower latency 
as well as high-capacity voice support. LTE uses single carrier- 
frequency division multiple access (SC-FDMA) scheme for the 
uplink transmission and orthogonal frequency division multiple 
access (OFDMA) in downlink. The one of the most important 
challenges for a terminal implementation are channel estimation 
(CE) and equalization. In this paper, a minimum mean square 
error (MMSE) based channel estimator is proposed for an 
OFDMA systems that can avoid the ill-conditioned least square 
(LS) problem with lower computational complexity. This channel 
estimation technique uses knowledge of channel properties to 
estimate the unknown channel transfer function at non-pilot sub- 
carriers. 

Index Terms — Channel estimation, LTE, least-square, 
OFDMA, SC-FDMA. 

I. Introduction 

The 3rd generation partnership project (3 GPP) members 
started a feasibility study on the enhancement of the universal 
terrestrial radio access (UTRA) in December 2004, to improve 
the mobile phone standard to cope with future requirements. 
This project was called evolved-UTRAN or long term evolu- 
tion [1], [22]. The main purposes of the LTE is substantially 
improved end-user throughputs, low latency, sector capacity, 
simplified lower network cost, high radio efficiency, reduced 
user equipment (UE) complexity, high data rate, and signifi- 
cantly improved user experience with full mobility [2]. 

3 GPP LTE uses orthogonal frequency division multiplexing 
access (OFDMA) for downlink and single carrier-frequency 
division multiple access (SC-FDMA) for uplink. SC-FDMA 
is a promising technique for high data rate transmission 
that utilizes single carrier modulation and frequency domain 
equalization. Single carrier transmitter structure leads to keep 
the peak-to average power ratio (PAPR) as low as possible that 
will reduced the energy consumption. SC-FDMA has similar 
throughput performance and essentially the same overall com- 
plexity as OFDMA [1]. A highly efficient way to cope with 
the frequency selectivity of wideband channel is OFDMA. It 
is an effective technique for combating multipath fading and 
for high bit rate transmission over mobile wireless channels. 
In OFDMA system, the entire channel is divided into many 
narrow subchannels, which are transmitted in parallel, thereby 



increasing the symbol duration and reducing the intersymbol- 
interference (ISI) [2], [4]. Channel estimation (CE) plays an 
important part in LTE OFDMA systems. It can be employed 
for the purpose of detecting received signal, improving the 
capacity of OFDMA systems by cross-layer design, and im- 
proving the system performance in terms of symbol error 
probability (SEP) [4], [5]. 

A key aspect of the wireless communication system is 
the estimation of the channel and channel parameters. CE 
has been successfully used to improve the performance of 
LTE OFDMA systems. It is crucial for diversity combination, 
coherent detection, and space-time coding. Improved channel 
estimation can result: improved signal-to-noise ratio, channel 
equalization, co-channel interference (CCI) rejection, mobile 
localization, and improved network performance [1], [2], [3], 
[18]. 

Many CE techniques have been proposed to mitigate inter- 
channel interference (ICI) in the downlink direction of LTE 
systems. In [3], the LS CE has been proposed to minimize 
the squared differences between the receive signal and es- 
timation signal. The LS algorithm, which is independent of 
the channel model, is commonly used in equalization and 
filtering applications. But the radio channel is varying with 
time and the inversion of the large dimensional square matrix 
turns out to be ill-conditioned. In [19], Wiener filtering based 
two-dimensional pilot- symbol aided channel estimation has 
been proposed. Although it exhibits the best performance 
among the existing linear algorithms in literature, it requires 
accurate knowledge of second order channel statistics, which 
is not always feasible at a mobile receiver. This estimator 
gives almost the same result as ID estimators, but it requires 
higher complexity. To further improve the accuracy of the 
estimator, Wiener filtering based iterative channel estimation 
has been investigated [4]. However, this scheme also require 
high complexity. 

In this paper we proposed a channel estimation method 
in the downlink direction of LTE systems. This proposed 
method uses knowledge of channel properties to estimate the 
unknown channel transfer function at non-pilot sub-carriers. 
These properties are assumed to be known at the receiver for 
the estimator to perform optimally. The following advantages 
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Fig. 1. OFDM transceiver system model. 



will be gained by using this proposed method. Firstly, the 
proposed method avoids ill-conditioned problem in the inver- 
sion operation of a large dimensional matrix. Secondly, the 
proposed method can track the changes of channel parameters, 
that is, the channel autocorrelation matrix and SNR. However, 
the conventional LS method cannot track the channel. Once 
the channel parameters change, the performance of the conven- 
tional LS method will degrade due to the parameter mismatch. 
Finally, the computational complexity of the proposed method 
is significantly lower than existing LS and Wiener CE method. 

We use the following notations throughout this paper: 
bold face lower and upper case letters are used to represent 
vectors and matrices, respectively. Superscripts x* denote the 
conjugate transpose of the complex vector x, diag(x) is the 
diagonal matrix that its diagonal is vector x; and the symbol 
E(.) denotes expectation. 

The remainder of the paper is organized as follows: sec- 
tion II describes LTE OFDMA system model. The proposed 
channel estimation scheme is presented in section III, and its 
performance is analyzed in section IV. Section V concludes 
the work. 

II. System description 

A. System model 

A simplified block diagram of the LTE OFDMA transceiver 
is shown in Fig.l. At the transmitter side, a baseband modu- 
lator transmits the binary input to a multilevel sequences of 
complex number ra(n) in one of several possible modulation 
formats including binary phase shift keying (BPSK), quandary 
PSK (QPSK), 8 level PSK (8PSK), 16-QAM, and 64-QAM 
[1]. CE usually needs some kind of pilot information as a point 
of reference. CE is often achieved by multiplexing known 
symbols, so called, pilot symbols into data sequence [15]. 
These modulated symbols, both pilots and data, are perform a 
N-point inverse discrete Fourier transform (IDFT) to produce 
a time domain representation [1]: 



s(ra) 



1 



N-l 

y m(n)e J 

n=0 



(1) 



where m is the discrete symbols, n is the sample index, and 
ra(n) is the data symbol. The IDFT module output is followed 
by a cyclic prefix (CP) insertion that completes the digital 



stage of the signal flow. A cyclic extension is used to eliminate 
intersymbol-interference (ISI) and preserve the orthogonality 
of the tones. 

B. Channel model 

Channel model is a mathematical representation of the 
transfer characteristics of the physical medium. These mod- 
els are formulated by observing the characteristics of the 
received signal. According to the documents from 3GPP [15], 
in the mobile environment, a radio wave propagation can 
be described by multipaths which arise from reflection and 
scattering. If there are L distinct paths from transmitter to 
the receiver, the impulse response of the wide-sense station- 
ary uncorrelated scattering (WSSUS) fading channel can be 
represented as [4]: 



L-l 



w ( r ^) = ^2 w i( t ) s ( r - r i)^ 



(2) 



1=0 



where fading channel coefficients wi(t) are the wide sense 
stationary i.e. wi(t) = w(m,l), uncorrelated complex Gaus- 
sian random paths gains at time instant t with their respective 
delays 77, where w(m,l) is the sample spaced channel re- 
sponse of the Ith path during the time m, and S(.) is the Dirac 
delta function. Based on the WSSUS assumption, the fading 
channel coefficients in different delay taps are statistically 
independent. Fading channel coefficient is determined by the 
cyclic equivalent of sinc-fuctions [7]. In time domain fading 
coefficients are correlated and have Doppler power spectrum 
density modeled in Jakes [13] and has an autocorrelation 
function given by [5]: 



E[w(m, l)w(n, Z)f] = cr^(/)rt(ra — n) 
= a 2 w (l)J [27rf d T f (m-n)i 



(3) 



where w(n,l) is a response of the Ith propagation path 
measured at time n, cr^(Z) denotes the power of the channel 
coefficients, fd is the Doppler frequency in Hertz, Tf is 
the OFDMA symbol duration in seconds, and Jn(.) is the 
zero order Bessel function of the first kind. The term f^Tf 
represents the normalized Doppler frequency [5]. 

C. Received signal model 

At the receiver, the opposite set of the operation is per- 
formed. We assume that the synchronization is perfect. Then, 
the cyclic prefix samples are discarded and the remaining N 
samples are processed by the DFT to retrieve the complex 
constellation symbols transmitted over the orthogonal sub- 
channels. The received signal can be expressed as [5]: 



L-l 



r(m) = y^ w(m, l)s(m — I) + z(m), 



(4) 



1=0 



where s(m — I) is the complex symbol drawn from a con- 
stellation s of the Ith paths at time m — I, and z(m) is the 
additive white Gaussian noise (AWGN) with zero mean and 
variance x. After DFT operation, the received signal at pilot 
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locations is extracted from signal and the corresponding output 
is represented as follows: 

M-\ 

R(k) = 2_^ r(m)e M 



ra=0 
M-\ 



2_. [w(m, l)s(m — I) + z(m)]e 



(5) 



ra=0 



The received signals are demodulated and soft or hard values 
of the corresponding bits are passed to the decoder. The 
decoder analyzes the structure of received bit pattern and 
tries to reconstruct the original signal. In order to achieve 
good performance the receiver has to know the impact of the 
channel. 

D. OFDMA waveform 

The frequencies (sub-carriers) are orthogonal, meaning the 
peak of one sub-carrier coincides with the null of an adjacent 
sub-carrier. With the orthogonality, each sub-carrier can be 



N sub-carriers 



Spacing 1/T 




Fig. 2. Orthogonal overlapping spectral shapes for OFDMA system. 

demodulated independently without ICI. In OFDM system, 
the entire channel is divided into many narrow sub-channels, 
which are transmitted in parallel, thereby increasing the sym- 
bol duration and reducing the ISI. 

Like OFDM, OFDMA employs multiple closely spaced sub- 
carriers, but the sub-carriers are divided into groups of sub- 
carriers. Each group is named a sub-channel. The sub-carriers 
that form a sub-channel need not be adjacent. In the downlink, 
a sub-channel may be intended for different receivers. Finally, 
OFDMA is a multi-user OFDM (single user) that allows 
multiple access on the same channel. Despite many benefits 
of OFDMA for high speed data rate services, they suffer from 
high envelope fluctuation in the time domain, leading to large 
PAPR. Because high PAPR is detrimental to user equipment 
(UE) terminals, SC-FDMA has drawn great attention as an 
attractive alternative to OFDMA for uplink data transmission. 

III. CE PROCEDURE 

CE is the process of characterizing the effect of the phys- 
ical medium on the input sequence. The aim of most CE 
algorithm is to minimize the mean squared error (MSE), 
while utilizing as little computational resources as possible 
in the estimation process [2], [4]. CE algorithms allow the 
receiver to approximate the impulse response of the channel 



and explain the behavior of the channel. This knowledge 
of the channel's behavior is well-utilized in modern mobile 
radio communications. One of the most important benefits 
of channel estimation is that it allows the implementation of 
coherent demodulation. Coherent demodulation requires the 
knowledge the phase of the signal. This can be accomplished 
by using channel estimation techniques. Once a model has 

One radio frame = 20 Slots =10 Sub-frames = 10 ms 



1 Slot = 7 OFDM symbols = 0.5 ms 



2 Slots = 1 Sub-frame = 10 ms 
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Fig. 3. OFDMA generic frame structure. 

been established, its parameters need to be estimated in order 
to minimize the error as the channel changes. If the receiver 
has a priori knowledge of the information being sent over the 
channel, it can utilize this knowledge to obtain an accurate 
estimate of the impulse response of the channel. 

In LTE, like many OFDMA systems, known symbols called 
training sequence, are inserted at specific locations in the time 
frequency grid in order to facilitate channel estimation [10], 
[15]. As shown in Fig. 3, each slot in LTE downlink has a 
pilot symbol in its seventh symbol [6] and LTE radio frames 
are 10 msec long. They are divided into 10 subframes, each 
subframe 1 msec long. Each subframe is further divided into 
two slots, each of 0.5 msec duration. The subcarrier spacing in 
the frequency domain is 15 kHz. Twelve of these subcarriers 
together (per slot) is called a physical resource block (PRB) 
therefore one resource block is 180 kHz [2], [3], [6]. Six 
resource blocks fit in a carrier of 1.4 MHz and 100 resource 
blocks fit in a carrier of 20 MHz. Slots consist of either 6 
or 7 ODFM symbols, depending on whether the normal or 
extended cyclic prefix is employed [10], [15], [17]. 

Channel estimates are often achieved by multiplexing train- 
ing sequence into the data sequence [18]. These training 
symbols allow the receiver to extract channel attenuations and 
phase rotation estimates for each received symbol, facilitating 
the compensation of channel fading envelope and phase. Gen- 
eral channel estimation procedure for LTE OFDMA system is 
shown in Fig. 4. The signal S is transmitted via a time- varying 
channel w, and corrupted by an additive white Gaussian noise 
(AWGN) z before being detected in a receiver. The reference 
signal West is estimated using LS , Wiener based, or proposed 
method. In the channel estimator, transmitted signal S is 
convolved with an estimate of the channel w est . The error 
between the received signal and its estimate is 



e = (r-ri). 



(6) 
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Fig. 4. General channel estimation procedure. 



The aim of most channel estimation algorithms is to minimize 
the mean squared error (MMSE), while utilizing as little 
computational resources as possible in the estimation process. 
The equation (4) can be written as vector notation as [1]: 



Sw + z, 



(7) 



where r = (r ,ri, ,r L -i)\ S = diag(s ,s 1 , ,sl-i) 

, w = (w ,wi, ,w L -iY, and z = (20,21, ,^L-i) f . 

The least-square estimate of such a system is obtained by 
minimizing square distance between the received signal and 
its estimate as [3]: 



J = (Sr - w) 2 = (r - Sw)(r - Sw) f . 



(8) 



We differentiate this with respect to wf and set the results 
equal to zero to produce [3]: 



w LS = (al + SS^-^r, 



(9) 



where a is regularization parameter and has to be chosen such 
that the resulting eigenvalues are all defined and the matrix 
(al + SSf) -1 is the least perturbed. Where the channel is 
considered as a deterministic parameter and no knowledge on 
its statistics and on the noise is needed. The LS estimator is 
computationally simple but problem that is encountered in the 
straight application of the LS estimator is that the inversion 
of the square matrix turns out to be ill-conditioned. So, we 
need to regularize the eigenvalues of the matrix to be inverted 
by adding a small constant term to the diagonal [3]. If the 
transmitted signal is more random, the performance of the LS 
method is significantly decrease. Also the LS estimate of w est 
is susceptible to Gaussian noise and inter-carrier interference 
(ICI). Because the channel responses of data subcarriers are 
obtained by interpolating the channel responses of pilot sub- 
carriers, the performance of OFDM system based on comb- 
type pilot arrangement is highly dependent on the rigorousness 
of estimate of pilot signals. The successful implementation of 
the LS estimator depends on the existence of the inverse matrix 
(SSf) -1 . If the matrix (SSf) is singular (or close to singular), 
then the LS solution does not exist (or is not reliable). 
To improve the accuracy of the estimator, Wiener filtering 
based iterative channel estimation has been investigated [4], 
[7]: 



w est = R^F t S t [(SFR^F t S t ) + x^ 1 ^. 



(10) 



where R ww is the autocovariance matrix of w, F is the 
DFT matrix, and x denotes the noise variance. However, this 
scheme also requires higher complexity. 



IV. Proposed MMSE based CE technique 
The equation (7) can we rewritten as [22]: 



Wi 



r z 

= S + S 
w 2 +zi, 



(11) 



where actual channel value is W2 = r/S, noise contribution 
zi = z/S, and Wi is the result of direct estimated channel. 
The proposed channel estimation is 

L-l 

Wprop = ^a^wi(fc) 

k=0 
L-l 

= 5>+[w 2 (fc)+zi(*)] 

k=0 



a.w 3 , 



(12) 



where a& = (ao,ai...,a L -i)"'" is the column vector filter 



^L-l 



coefficients, and W3 = J2k=ol w z(k) + zi(fc)]. The mean 
square error (MSE) for the proposed LTE channel estimation is 
J = (w — yv prop ) 2 . In order to calculate the optimal coefficient, 
taking the expectation of MSE and partial derivative with 
respect to channel coefficient: 



dE(J) _ _d_ 
<9at <9at 



(E[(W - W prop )(w - W prop y}). (13) 



Now putting the value of w prop = a T W3 into the above 
equation to produce: 



dE(J) _ _9_ 

<9at <9at 
_ _9_ 

~ Sat 



(E[(w — a t w 3 )(w — a" 1 " W3 )"•"]) 
(E[(w-a t w 3 )(w t -awj)]) 



flat 



(E[ww^ — a^w3wt — awgW + a^wsw^a]) 

= E[— w 3 wt + w 3 wja]. (14) 



Now putting the partial derivative equal to zero in the above 
equation and after some manipulations we get the coefficient 
as: 

a = E[(w 3 wt)](E[(w3W^)])- 1 

= [E(w 2 + zi)w f ][E((w2 + zi)(w 2 + zi) 1 ")]" 1 

= [£(w 2 w f + Zl w f )] [£((w 2 + Zl )(w£ + zj))]- 1 

= ^(w 2 w f + z i w t )[^(w 2 w^ + ZiW^ + w 2 z{ + ziz{)] _1 . 

(15) 

In this paper we assume that mean of the AWGN is zero i.e. 
E(z) = and variance is x i.e. E(zz^) = x. So, the above 
equation is simplified as: 

a = ^(w 2 w t )[E(w 2 w^) + ^(zizj)]- 1 

= £(w 2 w f )[£(w 2 wi>) + x}' 1 

= Across * (Wauto + x) -1 , (16) 
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where w cross = E(\V2^) and W auto = E(\V2w 2 ) are the 
channel cross-correlation vector and autocorrelation matrix re- 
spectively. Now putting this filter coefficient value in equation 
(12), we get the final channel estimation formula as: 

-ii 



Wt, 



Wr. 



*(W. 



auto 



-x) ]w 3 . 



(17) 



V. Complexity Comparison 

The complexity of CE is of crucial importance especially for 
time varying wireless channels, where it has to be performed 
periodically or even continuously. For this proposed estimator, 
the main contribution to the complexity comes from the term 
[w cross * (wauto + ^) -1 ]- The variance of the AWGN is pre- 
calculated and added with the autocorrelation matrix. Thus, 
only one run-time matrix inversion is required. Also W3 is 
pre-calculated column vector. Table I summarizes the com- 
putational complexity of the proposed and existing channel 
estimation methods. It shows that the proposed CE algorithm 
has lower complexity than existing methods. 

TABLE I 

COMPUTATIONAL COMPLEXITY OF ALGORITHMS 



Operation 



LS method Wiener method Proposed method 



Matrix inversion 

Multiplication 

Addition 



VI. Conclusion 

In this paper, we present a MMSE based channel estima- 
tion method for LTE OFDMA systems and compared the 
performance with the LS and Wiener based filtering method. 
This proposed channel estimation method uses knowledge of 
channel properties to estimate the unknown channel transfer 
function at non-pilot sub-carriers. It can well solve the ill- 
conditioned least square (LS) problem and track the changes 
of channel parameters with low complexity . 
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Abstract — RTCP has limitation with scalability for large 
streaming sessions; because of the limitation of the bandwidth 
space that given to RTCP reports. Many researchers studied and 
still studying how to solve this limitation, and most of the 
researchers come out with tree structure as a solution but in a 
different ways. 

Keywords- RTCP/RTP; Scalability; Large Streaming Sessions 

I. Introduction 

Communication is everywhere in our life and as a part of it 
is multimedia communication like, VoIP, multimedia 
conferencing, teleconferencing, video surveillance, satellite 
communication, etc. Multimedia communication is mainly 
using Real time protocol (RTP) that works together with Real 
time control protocol (RTCP) to do transfer data that depend on 
real time like video or audio over networks. RTCP is used to 
monitor RTP-Packets and reports feedback [1]. The main 
function of RTCP is to transmit periodically the sender and 
receiver reports to all members in RTP/RTCP. These reports 
allow a host to know if a problem exists or not and if the 
problem is local or global [2]. RTCP like other protocols and 
techniques facing a lot of problems, one of the important 
problems is RTCP scalability. [3] Increasing the number of 
hosts in RTP/RTCP sessions caused some problems under the 
name of the scalability problems; the problems can be the 
feedback delay, storage problem, flood of initial/bye RTCP 
reports, etc. 

II. Scalability of RTCP 

RTP is a real time transmission protocol of audio and video, 
which provide several functions that help the transmission of 
audio and video such as [4]. Identification of payload data type, 
give a Sequence numbering to detect packet loss and to order 
packets, Time stamping so that data is played out at the right 
speeds [5]. RTCP is the attached protocol to RTP, and its 
working by sending/receiving reports, and it has several kinds 
of reports the main reports in RTCP are the sender report (SR) 
and receiver report (RR). Both include performance statistics 
on the total number of packets lost since the beginning of 
transmission, the fraction of packet loss in the interval between 
sending this feedback report and sending the previous one, the 
highest sequence number received, jitter, and other delay 
measurements to calculate the round-trip feedback delay time. 



The SR provides more statistics summarizing data transmission 
from the sender, e.g. timestamps, count of RTP data packets, 
and number of payload octet's transmitted [4]. 

• RRs are used mainly in sender-based adaptive 
applications (The packet loss parameter in the RRs has 
been used as an indicator of congestion in the 
network). 

• The SR is useful in lip-synchronization (inter-media 
synchronization) and in calculating transmission bit 
rates. 

What does scalability mean when it uses with RTCP term, 
increase the number of hosts in one session then increase the 
number of RTCP report, which means RTCP scalability. RTCP 
is kind of reports' protocol that sends/receive reports from/to 
hosts in one session. These reports limited to bandwidth size, 
RTCP given 5% from the whole session bandwidth size. RTCP 
has two types of the report: Sender report and Receiver report. 
Sender report uses 75% of RTCP bandwidth size while 
Receiver report uses 25%. The limitation of bandwidth size of 
RTCP makes it control the interval of sending receiving 
reports, increasing the number of hosts caused to increase the 
interval of sending receiving time and that makes the reports 
useless [6]. 

III. Challenges of RTCP Scalability 

Many researchers studied and wrote about RTCP problems, 
and the most important problems are about the scalability of 
RTCP and reports' feedback in a large streaming session. The 
scalability in RTCP faces many problems when it comes to a 
group of thousands of users. Some of these problems are 
addressed in [2 ... 11]. The first serious study done by El- 
Merakby, with three main researches 1998 "A Scalability 
Scheme for the Real-time Control Protocol", 2000 "Design and 
Performance of a Scalable Real Time Control Protocol: 
Simulations and Evaluations", and 2005 "Scalability 
improvement of the real time control protocol" where she 
studied RTCP problems with scalability and suggested a 
solution by divide the large session group to small session 
groups [4, 8, 9]. 
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A Feedback delay challenge 

One of the important challenges is the feedback delay and 
caused by increasing the group size, because of the limitation 
of the bandwidth size the RTCP reporting interval increased 
which decreases the significance and value of the feedback, and 
then the feedback reports either send rarely or not at all [3, 4, 8, 
9]. 

B. Storage challenge 

The group size could be known if every member stores a 
count of distinct for every member it heard during the session 
using the unique Synchronization Source identifier (SSRC) 
found in the RTP header [3, 4, 8, 9]. 

C. Multicasting RRs to the whole group (bandwidth effect) 

Every member in the session group will multicast RRs 
(Receiver Report) to all other which are not senders and that 
causes a load at every member processing and Congestion will 
happen because of members increase then RR increase also[4, 
8, 9, 10]. 

D. Initial/bye flood challenge 

If many members join/leave the session at the same time, a 
flood of join/Bye packets will happen and congestion in the 
network may occur, especially at members who have low 
bandwidth links [3, 4, 5, 8, 9]. 

El-Merakby tried to explain that the normal case for RTCP 
feedback reports are multicast mainly for receivers to calculate 
the group size and thus compute their RTCP reporting interval, 
and the suggested solution is saying that the members do not 
need to compute the whole size of the multicast group and RRs 
are not multicast, and to divide the big session to many groups. 
The proposed structure is called S-RTCP, and shown in Fig 1, 
explains how members organize dynamically in a multi-level 
hierarchy of local regions. 
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The following are the advantages of using the scheme in 
large RTCP groups [4, 8, 9]: 




Figure 1. Structure of El-Merakby scheme [9]. 

Each region has an aggregator (AG). Each member sends 
the RR feedback to its AG which gathers and aggregates 
statistics from these reports which is passed to a manager or to 
another AG level. Additional statistics are computing by the 
manager to evaluate the transmission quality and to estimate 
the regions which suffer from congestion. Time-to-Live (TTL) 
field in the IP header is used by the scheme to build the multi- 
level hierarchy with locally scoped regions [4, 8, 9]. 



• Resolving the storage scalability problem: Members do 
not need to store the state of the distinct member in the 
group because they are in a different small group size. 

• Timely reporting of feedback reports: Feedback reports 
become more useful because the number of members 
became less. 

• Effective use of the bandwidth: the formation of local 
regions where RRs are not multicast but are sent with 
limited scope and not global scope decrease the number 
of RRs. 

• Decrease in the number of redundant reports: the total 
number of redundant RRs, which used to be multicast, 
is decreased, because the measurements in RRs are 
aggregated into AGRs summarizing the quality of the 
received data. 

In the other side, another researcher was interesting in the 
same area, Julian from University of Cambridge. He published 
an article with title "An Extensible RTCP Control Framework 
for Large Multimedia Distributions" in 2003. Julian thought 
that is two serious challenges with RTCP and they are: the 
growing of using unidirectional and asymmetric broadcast 
architectures, and the second challenge: per-receiver RTCP 
reporting frequency diminishes prohibitively due to the 
bandwidth-sharing algorithm [2]. 

A The growing deployment of unidirectional and 
asymmetric broadcast architectures challenge 

In RTP/RTCP, the data and control share a many-to-many 
communication channel, such as that provided by IP multicast 
[11]. The unidirectional and asymmetric broadcast 
architectures have problems with these issues; instead the 
channel allows not only the bidirectional flow of 
communication from sources to receivers and vice-versa, but 
also direct receiver-to-receiver communication over a single 
channel [2, 11]. 

B. Per-receiver RTCP reporting frequency diminishes 
prohibitively due to the bandwidth-sharing algorithm 

RTCP is keeping the frequency of reports inversely 
proportional to the number of members. And because of that 
RTCP institutes a bandwidth-sharing algorithm that divides the 
resources of the control channel among members' group. The 
standard bandwidth-sharing algorithm used by RTCP expects 
that as groups grow in size, the frequency of individual 
feedback reports will decrease [2]. This problem is the same 
challenge that introduced by El-Merakby, 1998 with a 
challenge title Feedback delay challenge [4]. 

To solve these challenges, two new schemes are devised 
that are complementary to the existing RTCP feedback 
algorithm and influence the unique characteristics of 
summaries to efficiently scale the feedback of the unicast 
backchannel for large groups. The two schemes that influence 
summarization to scale the backchannel: biasing and 
hierarchical aggregation [2]. 
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• The technique of biasing provides preferential treatment 
to the feedback of one or more groups of receivers. 

• The technique of hierarchical aggregation supports the 
existence of multiple summarization nodes throughout a 
collection topology, which distributes the load and 
bandwidth usage of summarization, and in turn lends 
much-needed support to heterogeneous topologies as 
well as to frequency-driven applications. 

In 2005, another researcher was interesting in scalability of 
RTCP, but instead of studying the main RTCP protocol. He 
decided to study S-RTCP (Scalable Real Time Protocol) that 
invented by El-Marakby [4, 8,9], Elramly published two papers 
"Scalability Solutions For Multimedia Real-Time Control 
Protocol (PDPTA'06)" 2005, and "Analysis, Design, and 
Performance Evaluation of MS-RTCP: More Scalable Scheme 
for the Real-Time Control Protocol" 2005. Elramly introduced 
a new protocol MS-RTCP (More Scalable Real Time Control 
Protocol). MS-RTCP scheme is based on a hierarchical 
structure, distributed management, and EL-Marakby scheme. 
The idea of El-Marakby scheme is depending on a tree-based 
hierarchy of report summarizers. The tree leafs (nodes) in the 
session send RRs to some node that acting as AG (aggregator), 
collects and summarizes these reports. The summaries result 
then passed up to the next highest level in the tree, until finally 
they reach the sender or some other appropriate feedback point. 
The summarization scheme is most useful when the nodes at 
one level of a sub-tree see similar network performance. El- 
Marakby uses network hop counts to a summarizer, measured 
through Time To Live (TTL), to group hosts together in the 
tree. She also proposed a dynamic scheme for building the tree 
[4, 8, 9, 12, 13]. The most important introduced problems by 
Elramly to El-marakby scheme (S-RTCP): 
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The MS-RTCP managers join the control multicast group, 
while the MS-RTCP children join the data multicast group. 
Each group of children constructs a region which is controlled 
by a manager. Each child should know its region before sharing 
the RTP session. The Load Balancing Manager (LBM) 
accomplishes this target by testing the real position of each 
child and the real number of children per region. The scheme 
structure is shown in Fig. 2. 



1. 



3. 



4. 



5. 



Fault tolerance is not guaranteed: When any AG is 

crashed or has left the RTP session, all the children in its 

region search for other AG. This will affect the 

convergence time during this interval, and will make an 

old child to a new one. 

Load balancing is not guaranteed: The load balancing 

depends on the maximum number of children with which 

the AGs deals. This is not sufficient, because if we 

suppose that the maximum number of each AG is 100 

children, we may find AG has 90 children and another has 

10 children. 

The structure of the model depends on the central 

processing unit (manager). Hence, if the manager is 

crashed there is no other unit that can take place. The 

model in this case will become unstable (failed in worst 

case). 

Overkill the small groups, which the IP telephone is 

mainly dealing with. This is due to the condition of low 

children number per AG that was put after the model 

evaluation [8]. 

Election of LAN Aggregator (LAG) is not sufficient. 

Source Description (SDES) items [1] about any multicast 

group member will take a long time to access it. 
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Figure 2. General view of Elramly schem [13]. 

The suggested solution and the proposed protocol MS-RTCP: 

1. Fault tolerance guarantee: any control component in the 
MS-RTCP has a spare one. If one or more scheme 
components fail, can replace the failed one with its spare 
one by the decision from the GM (or FTM) until the failed 
one is fixed and the normal situation is re-established. 

2. Load balancing guarantee: the decision for receiving a 
new child in the RTP session depends on the real number 
of children per scheme manager. Consequently, if any 
new child joins a session, it is told with the best manager 
taking in consideration the manager load (real children 
number) and the new child position. 

3. The basic idea of the MS-RTCP is based on the 
management distribution. So, the central management 
processing is eliminated, as the scheme has one manager 
for each management process. 

4. The maximum number of children per manager reaches 
the upper limit at which the RTP session is working safely 
(some hundreds or more). Hence, if a small group joins 
the RTP session, the MS-RTCP will be transformed to the 
simple RTCP view with one manager and it's spare. The 
other MS-RTCP managers will be found, but with 
minimal overhead (neglected values). 
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5. The LAN Manager (LAN-M) has its pre-determined spare 
component. So, in this scheme, no need to elect another 
LAN-M when the basic one fails. 

6. The GM can access any data about any MS-RTCP entity 
by requesting the SDESM (or by CDM in case of any 
problem happened for the SDES-M). 



In Brno University of Technology, Czech Republic, 2007, 
new researchers work with scalability of RTCP. Dan Komosny, 
and Vit Novotny, started to study scalability of RTCP and its 
challenges. They published three papers together, "Tree 
Structure for Source- Specific Multicast with Feedback 
Aggregation" 2007, "Optimization of Large-Scale RTCP 
Feedback Reporting in Fixed and Mobile Networks" 2007, and 
"Large-Scale RTCP Feedback Optimization" 2008. The 
researchers find that RTCP in a large session causes delays in 
sending feedback data from each receiver, and to solve this 
problem they proposed a hierarchical structure and new 
protocol called Tree Transmission Protocol (TTP). Their work 
more to be extended to Julian 2003[2], where they solve some 
of the hierarchical structure that proposed by Julian 2003. The 
Fig. 3 is shown the tree structure [14, 15, 16, 17]. 
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Dan Komosny, and Vit Novotny come out with: 

1. Solve round trip delay by suggestion tree structure. 

2. The problem of the RTCP feedback tree establishment 
was solved. 

3. The method for finding the nearest summarization node in 
the IP network structure was designed. 

4. To manage the tree the new protocol (TTP) was designed 
and specified. 
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(1 Source — 
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Figure 3. Tree topology of the RTCP feedback network [17]. 

To solve the problem of hierarchical structure organization 
new protocol has been proposed TTP (Tree Transmission 
Protocol), this protocol is quite a flexible and robust protocol 
use to organizing the hierarchical tree overlays. It can be used 
for any hierarchically organized protocols. It can work for 
simple hierarchical tree overlays as well as for large-scale 
overlays with many hierarchical levels. The fig 4 shows TTP 
position [15, 16]. 
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Figure 4. Position of TTP to related protocols [16]. 



Shaahin Shahbazi comes with two papers about RTCP 
scalability, "A new design for improvement of scalable-RTCP" 
2009, and "Error Resistant Real-Time Transport Control 
Protocol" 2009. As Elramly 2005, he preferred to deal with El- 
Marakby protocol (Scalable Real Time Control protocol) 
SRTCP. Shahbazi explained about the challenges associated 
with S-RTCP, and proposed different approaches to solve 
the challenges [18, 19, 4, 8 ,9]. 

A The S-RTCP challenges 

• Congestion: If the number of AGs becomes very big 
of AG-0, congestion may occur at the links connected 
to AG-0. 

• Overload: because of the same reason overload may 
happen. 

• Lack of error- tolerance: S-RTCP design is quite 
vulnerable to any sort of malfunctions within AG-0. 

B. Proposed Design for Stability Improvement 

problems are caused due to the singularity of AG-0 in 
S-RTCP 's design and also the fact that no precautions 
have been taken into account in case AG-0 fails [18]. See Fig. 

5. 

• The number of AG-0 nodes: fix the amount of AG-Os 
at two. 

• Mirrored tasks versus split tasks: all AG-Os receive 
the same information. 

• Existence of pre-assigned AG-0: new design will 
allow two or more AG-Os to operate within the 
session, in order to provide stability. 




Normal members 



Figure 5. The architecture of the modified version of S-RTCP [18] 

Shahbazi was designed a new proposed scheme ER-RTCP 
(Error Resistant Real-Time Transport Control Protocol). 
Modifications included designing the multi-manager scheme, 
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improving parent-seeking procedures, reducing distribution of 
request packets, reforming the design to be independent of 
TTL, adding methods to check on sanity of manager nodes. 
This study considered packet loss ratio of below 2% as 
desirable [18, 19]. See Fig. 6. 



IV. 
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COMPARISON BETWEEN RESEARCHERS' WORK 



The comparison can be show in Table 1. 



V. Summary 




RTCP protocol has some problems and many researchers 
studied to solve it specially the scalability issue, the main 
problem was with the limitation of bandwidth space that given 
to RTCP reports, which all the researcher come with tree 
structure as a solution for it but in different ways. 



Figure 6. The architecture of ER-RTCP [19] 



TABLE I. 



COMPARISON BETWEEN RESEARCHERS' WORK 



Researcher 


Period of 
studies 


Problems that solved 


New 
suggested 
protocol 


El-Marakby 


1998, 2000, 
2003, 2005 


1- The research studied RTCP. 

2- Feedback delay. 

3- Increasing storage state. 

4- Multicasting RRs congestion. 

5- Flood (Initial/Bye). 

6- Come out with tree structure. 


S-RTCP 


Julian 
Chesterfield 


2003 


1- The research studied RTCP. 

2- Unidirectional and asymmetric broadcast architectures problem. 

3- Bandwidth- sharing algorithm. 

4- Come out with tree structure. 


- 


El-Ramly 


2005 


1- The research studied S-RTCP. 

2- Fault tolerance is not guaranteed. 

3- Load balancing is not guaranteed. 

4- The structure of the model depends on the central processing unit. 

5- Overkill the small groups. 

6- Election of LAN Aggregator (LAG) is not sufficient. 

7- Source Description (SDES) items. 

8- Come out with tree structure. 


MS-RTCP 


Dan Komosny, 
& Vit Novotny 


2007, 2008 


1- The research studied RTCP. 

2- Round trip delay. 

3- The RTCP feedback tree establishment. 

4- Finding the nearest summarization node in the IP network structure. 

5- Come out with tree structure. 


TTP 


Shaahin 


2009 


1- The research studied S-RTCP. 

2- Congestion. 

3- Overload. 

4- Lack of error-tolerance. 

5- Come out with tree structure. 


ER-RTCP 
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Abstract-Multi-carrier (MC) scheme became a 
promising technique for its spectral efficiency 
and robustness against frequency-selective 
fading. Multi-carrier code division multiple 
access (MC-CDMA) is a powerful modulation 
technique that is being considered in many 
emerging broadband communication systems. 
MC-CDMA combines the advantages of multi- 
carrier modulation with that of code-division 
multiple access (CDMA) to offer reliable high- 
data-rate downlink cellular communication 
services. The MC-CDMA signals are a 
superposition of many narrow-band signals and, 
as a result suffer from strong envelope 
fluctuations which make them very prone to 
nonlinear effects introduced by high power 
amplifier (HPA). HPA introduces conversion in 
both amplitude and phase. In this paper we have 
focused on the signals at the output of the 
nonlinear distorting device. A practical 
technique for determining the bit error rate 
(BER) of downlink MC-CDMA systems using 
binary phase- shift keying (BPSK) modulation 
scheme. The results are applicable to systems 
employing a coherent demodulation with 
maximal ratio combining (MRC) and equal gain 
combining (EGC). 

Keywords- MC-CDMA systems, high power 
amplifiers, nonlinear distortions, maximal ratio 
combining (MRC) y equal gain combining (EGC). 



1. INTRODUCTION 

Future wireless radio networks need to 
make efficient use of the frequency spectrum by 
providing high capacity in terms of number of 
users allowed in the system. Due to the 



advantages of spectrum efficiency, interference 
immunity, high data rate, and sensitivity to 
selective fading channels. Multi-carrier Coded- 
division multiple-access (MC-CDMA) appears to 
be a recommended candidate for future radio 
communication systems. It exploits the 
advantages of spread spectrum and the 
advantages of multi-carrier systems [1]. 

MC-CDMA signals are considered as 
superposition of many narrow-band signals, and 
as a result suffer from strong envelope 
fluctuations which make them very prone to 
nonlinear effects introduced by high power 
amplifiers (HPA s) [2]. 

Power amplifiers (PA's) are vital 
components in many communication system. The 
linearity of a PA response constitutes an 
important factor that ensures signal integrity 
and reliable performance of the communication 
system. High power amplifiers in microwave 
range suffer from the effects of amplitude 
modulation to amplitude modulation distortion 
(AM/AM), and amplitude modulation to phase 
modulation distortion (AM/PM) [3], during 
conversions caused by the HPA amplifiers. These 
distortions can cause intermodulation (IM) 
distortion, which is undesirable to system 
designs. The effects of AM/AM and AM/PM 
distortions degrade the bit error rate 
performance of a communication channel. 

The amplitude and phase modulation 
distortions are minimized using linearization 
method. The linearization method requires 
modeling the characteristics of the amplitude 
distortion and phase distortion of the HPA. 
A Saleh model [4] for traveling wave tube 
(TWT) amplifiers, has been used to provide 
the linearization method and applied to 
measured data from HPA that characterize 
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the distortion caused by the HPA. The 
measured data provides a performance 
curve indicating nonlinear distortion. The 
forward Saleh model is a mathematical 
equation that describes the amplitude and 
phase modulation distortions of the HPA. 

The BER analysis of MC-CDMA based on 
considering different kinds of assumptions, so 
far, have been dedicated in numerous researches 
in advance . 

Performance enhancement of MC-CDMA 
system through, space time trellis code (STTC) 
site diversity with multiple input multiple output 
(MIMO) technique was introduced in [5]. 

A method efficiently suppressing multiple 
access interferences (MAI) in MC-CDMA to 
improve the system capacity was proposed in [6]. 

The performance of fully loaded downlink 
MC-CDMA systems in the presence of residual 
frequency offset (RFO) in multipath Rayleigh 
fading channels with minimum mean square 
error (MMSE) equalizers was presented in [7]. 

The performance analysis of MC-CDMA 
communication systems over Nakagami-m fading 
channels was considered in [8]. 

A downlink MC-CDMA system using binary 
phase-shift keying (BPSK) modulation scheme 
and maximal ratio combining (MRC) in 
frequency-selective Rician fading channels was 
illustrated in [9]. 

The aim of this paper is to analyze the 
influences of the effects of the nonlinear 
distortions introduced by HPA in downlink MC- 
CDMA over Rayleigh fading channel for mobile 
satellite communication systems. The structure 
of this paper is as follows. The basic principles 
model of transmitter system is presented and 
described in more details in section 2. Section 3 
summarizes the HPA baseband models, which is 
most commonly used in mobile satellite 
communication systems. Subsequently in section 
4, the channel model is described. The receiver 
model will be described in section 5. 
Performance analysis of linearized downlink 
MC-CDMA based signal is carried out for both 
EGC and MRC. 



2. MC-CDMA TRANSMITTER 
MODEL 

The input data symbols, a m fkj, are assumed 
to be binary antipodal where k denotes the kth 
bit interval and m denotes the mth user. It is 
assumed that a m [k] takes on values of -1 and +1 
with equal probability. 

As shown in Figure. 1, a single data symbol is 
replicated into N parallel copies. Each branch of 
the parallel stream is multiplied by a chip from a 
spreading code of length N. Each copy is then 
binary phase-shift keying (BPSK) modulated to a 
subcarrier spaced apart from its neighboring 
subcarriers by F/T b Hz where F is an integer 
number. An MC-CDMA signal consists of the 
sum of the outputs of these branches. 

As illustrated in Figure. 1, the transmitted 
signal for MC-CDMA system corresponding to 
the kth data bit of the mth user is [10] 

N-l 

S m (t)= Y,C m [i]a m Ik] • 

i=0 



cos (2nf c t + 2ni (F/T b )t 

Pn(t-kT b ) 

CJi] €{-1,1} 



(1) 



where C m [0], C n [l], , C m [N-l] represent the 

spreading code of the mth user and Pj b (t) is an 
unit amplitude pulse that is non-zero in the 
interval [0,T b ]. 



aJH 



C J°] cosQntt) 



■<»• 



x» — i 

C JN-l] t 0i(2ri [ t+2rf(N.l)lT v l 



I 



uo 



Fig. 1 Transmitter Model of MC-CDMA 
System 
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3. NONLINEARITY EFFECTS ON 
MC-CDMA SIGNAL 

The response of broadband power 
amplifiers can have precarious memory 
effect. The influence of a memory-less 
nonlinearity U(.) can be decomposed into an 
amplitude distortion (AM/AM) and a phase 
distortion (AM/PM), which are both 
functions of the amplitude of the input signal 
to HPA. The complex signal S (t) at the 
output of HPA, can be defined as [11] 

S (t) = U{S m (t)}= A(\S m (t)\). 

ex V (j<P(\S m (t)\))S m (t) (2) 

A[S m (t)] and 4>[S m (t)] are the corresponding 
AM/AM and AM/PM characteristics 
respectively, both dependent exclusively on l/ x , 
which is the input modulus to HPA, they are 
defined as Saleh Model for HPA [12]: 



subsequently results in increasing the bit error 
rate (BER), and the out-of-band energy 
radiation ( spectral spreading ). 




0,4 0.6 0.8 

Normalized Input Magnitude 



Fig. 2. AM/AM and AM/PM characteristics of 
the Saleh model For TWTA HPA' s 



A[U X ] = a a U x I 1 + p a U 2 X 
<P[U x ] = a U x I 1+P*U 2 X 



(3) 



The values of a a , p a , a& and /?# are defined in 
[3]. 

The corresponding AM/AM and AM/PM 
curves so scaled are depicted in Fig. 2. 

While for solid state power amplifier types 
(SSPAs) AM/AM and AM/PM can be defined as 

A[U X ]= U x I [l + (U x /A max ) 2p ] 1/2p 
<P[U X ] = (4) 

A max is the maximum output amplitude, and p 
is a constant controls the smoothness of the 
transition. 



A max =max(A[l/ x ]) = a a A s ll 



(5) 



where A s is the input saturation amplitude 
equals 1 / Vp a 

The HPA operation in the region of its 
nonlinear characteristic causes a nonlinear 
distortion of a transmitted signal, that 



The operating point of HPA is defined by 
input back-off (IBO) parameter which 
corresponds to the ratio of saturated output 
power (P ), and the average input power ( P av ) 
[13] : 

IBO dB = 10 logio(P / Pav) (6) 

The measure of effects due to the nonlinear 
HPA could be decreased by the selection of 
relatively high values of IBO 

The output of HPA defined in Fig. 3, is 
expressed as 



b y = A [ U x ] e 1 



(ax+0fUxJ) 



(7) 



where the input-output functional relation of the 
HPA has been defined as a transfer function. 
Hence in order to obtain linearization, it may be 
necessary to estimate a discrete inverse 
multiplicative function HPA" 1 [.] such that 



b x = b Y . HPA _1 [U Y ] 



(8) 



An alternative expression for the AM/AM 
distortion in (7), convenient for the theoretical 
formulation of the linearizer, is obtained by 
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multiplying the saturation input amplitude As 
in the expression (3). This gives 

A[U X ] = (A\ aa UJ I (A\ + A 2 S p a U 2 J 
A[U X ] = (A 2 S oca UJ I (A 2 S + U 2 ^ (9) 

The theoretical AM/AM inverse transfer 
function A" 1 [.] could be determined by solving (9) 
forl/^AIAMt/J} 



[u]= (A\a./2U) 



\j -\|l- (217/ A s a a f J 



(10) 



Considering the alternative configurations 
shown in Fig. 3, where the same input-output 
function is applied as a pre-distorter [PD] for the 
linearization of the same HPA. Letting \|/[.] 
denote the AM/PM characteristic of the PD 
block. 

For the case of a Pre-distortion, we have [12] : 



b tout = A- 1 [U x U CaK+vtUxl) 
b y = A [a-'IUx] J • 

J(ax+ y[Ux] +0[A-l[Ux] ) 



(ID 



(12) 



57 



PD 



Up out 



HPA 



(A J ,i|/) (A,0) 

Fig. 3. Pre-distortion for HPA Linearization 
The ideal AM/PM correction requires that 



V|/[t/ x ] =-<t>{ A^UJ } 

b pin =A[U x ]<J (ax + 0[UxJ) 
b y = A - 1 { A[U X ]} • 

i(ax+ 0fUxJ +itfA-l[Ux] ) 



(13) 
(14) 

(15) 



Pre-distortion linearization idea, as depicted 
in Fig. 4, can be used to linearize over a wide 



bandwidth. This is achieved by pre-distortion of 
the signal prior to amplification with the inverse 
characteristics of the distortion that will be 
imposed by the power amplifier. Thus the output 
of the HPA is a linear function of the input to the 
predistorter . 



HPA 




J pout 




Fig. 4. Basic System Functional Diagram of Pre- 
distortion Linearization 



A description of the ideal theoretic AM/AM 
and AM/PM inverse characteristics, valid for the 
normalized Saleh's HPA model is shown in Fig. 5 
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Fig. 5. AM/AM and AM/PM pre-distortion for 
the Saleh model 



4. CHANNEL MODEL 

A frequency-selective fading channel with 
1/T b « BW C « F/T b is considered, where BW C is 
the coherence bandwidth. Each modulated 
subcarrier with transmission bandwidth of 1/T b 
does not experience significant dispersion (T b » 
T d ). Doppler shifts are very small, it is also 
assumed that the amplitude and phase remain 
constant over the symbol duration, T b . 



1.C 
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For downlink transmissions, a terminal 
receives interfering signal designated for other 
users (m = 1, 2, ...., M-l) through the same 
channel as the wanted signal (m=0), the transfer 
function of the continuous-time fading channel 
for all transmissions from the base station to user 
m = can be represented as 



H(f c + iF/T b ) = p m4 e 



JK,i 



(16) 



where p mfi , and #„,,,-, are the random amplitude 
and phase of the channel of the mtfi user at 
frequency f c + i (F/T b ). p m j are assumed to be 
independent and identically distributed (IID) 
Rayleigh random variables. The random phases , 
m ,i are assumed to be IID random variables 
uniform on the interval of {0 , 2n} for all users 
and subcarriers. 



5. RECEIVER MODEL 



For M active transmitters, the received signal 
is [10] 



Ml N-l 

r ft) = I I Pm,iC m [i]a m [k] 



m=0 i=0 



COS 



(2nf c t + 2m [F/T b ]t +0 m>i ) + n (t) 



(17) 



where n(t) is additive white Gaussian noise 
(AWGN). The local-mean power at the itfi 
subcarrier of the mth user is defined to be p m ,, = 
E^ )( / 2. Assuming the local-mean powers of the 
subcarriers are equal, the total local-mean power 
of the mth user is equal to p m , = N p m fi . 

As shown in Figure. 6, the first step in 
obtaining the decision variable involves 
demodulating each of subcarriers of the received 
signal, which includes applying a phase 
correction, { , and multiplying the \th subcarrier 
signal by a gain correction, d,. 



— ►»- 

2/Ttfos5fcH 



do 






2.T b cos(2T[f c t+2KFt/Tb + e 1 ) 




'^"^ V >[nTKj- / - 



2ir b cos(2xf c t + 2kF(N-l)tT b + ^_ { ) 

Fig. 6 Receiver Model 

After adding the subcarrier signals together, 
the combined signal is then integrated and 
sampled to yield decision, V . For the kth bit, the 
decision variable is 

M-l N-l 

V =Y J I p m ,iC m [i] d { a m [k]- 

m = i=0 

(k+l)T b 

fcos (2nf c t + 2ttF [i/T b ]t + m , { ) • 



cos (2nf c t + 2nF [i/T b ]t + m ,, )dt + t\ 



(18) 



where the corresponding AWGN term, i/, is 
given as 

N-l (k+l)T b 

9=1 / n(t) (2/T b ) d { • 

i=0 kT b 

cos (2nf c t + 2nF [i/T b ]t + m ,, )dt (19) 

Considering the two standard diversity 
reception techniques: Equal Gain Combining 
(EGC) and Maximum Ratio Combining (MRC) 

With EGC, the gain correction factor at the ith 
subcarrier is given as 



do,i = c [i] 



(20) 
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2- with MRC 



N-l 



V o = a [k] X P,io + fint +n 



(21) 



N-l 



a [k] X P,w represents the desired signal, and 



i=0 



the interference term, fl int > is defined by 

M-l N-l 

fiint = Z Z C « Pi a "> W C Pi /W COS ^"m ,i 
m=0 i*=0 

(22) 

For MRC scheme, the gain correction factor 
at the \th subcarrier is given as 



do,i = p 0i i c [i] 



(23) 



The decision variable for MRC scheme is 
expressed as 



N-l 



V =a [k] Z /io + / ? i"t +9 



(24) 



where, the interference term, fl int > is defined in 
this case by 



M-l N-l 
Pint = Z Z C * Pi a m WQPiPm.i Po.i ™S »'„,,■ 



m=0 /=0 



(25) 



6. PERFORMANCE ANALYSIS 

The downlink BER had been calculated 
as [10] 
1-with EGC 

BER = 



P,T t 



1/2 erfc 7i 



(26) 



4 2(M-1) 



(i-7r/4)p r b +jv„ 



N 



BER = 



PoT b 



1/2 erfc 



(27) 



2(M -1) 



N 



(l-n/4)P„T b +N, 



7. NUMERICAL RESULTS 

A fair measure is given by using the 
normalized minimal signal-to-noise ratio 



SNR,= 101og(P„r 5 /JV )(dB) 



(28) 



which is needed to achieve the wanted BER. T b 
is the equivalent duration for one information 
bit, N is the two sided spectral noise density, 
and P is the given reference power of HP A. The 
SNR can be minimized by optimization of the 
HPA backoff. This becomes more clear, when eq. 
(6) is used in eq. (28) : 



SNR = 10 log (P T b P av /N P av ) 
= 101og(E & /N ) + OBO 



(29) 



The average downlink bit error rate (BER) 
versus the number of interferes are examined. 
For the sake of comparison, the BER for both 
types of diversity, EGC and MRC are illustrated 
under interferers numbers, N = 32, 64, and 128 , 
with SNR = 10 dB in Figures 7, and 8. 

It can be seen that for a small numbers of 
users, MRC outperforms EGC. It was also 
demonstrated the PD effect to mitigate the 
nonlinearity distortions introduced from HPA in 
Fig. 9, and Fig. 10. 
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Fig. 7. BER versus the No. of Interferers for 
EGC case 
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Fig. 9. BER versus the SNR using PD 
for EGC case 



35 



1.e+0 



SNR = 10 dB 



1.e-1 - 



1.e-2 



LiJ 1.e-3 

OQ 



1.e-4 - 



1.e-5 



1.e-6 




50 100 150 

No of Interferers 



• N = 128 
■ N = 64 
A N = 32 



200 



250 



1.e+0 

1.e-1 

1.e-2 

1.e-3 

1.e-4 

1.e-5 

1.e-6 

1.e-7 

1.e-8 - 

1.e-9 - 

1.e-10 

1.e-11 

1.e-12 

1.e-13 

1.e-14 

1.e-15 

1.e-16 








• with PD (m = interferers) 

■ without PD (oBo = 5 dB) (m = interferers) 

A with PD (m = 70 interferers) 

V without PD (oBo = 5 dB) (m = 70 interferers) 



10 



15 
SNR 



20 
dB 



25 



30 



35 



Fig. 8. BER versus the No. of Interferers for 
MRC case 



Fig. 10. BER versus the SNR using PD 
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8. CONCLUSIONS 

In this paper, the downlink transmission in 
MC-CDMA systems with nonlinear HP A of 
transmitter over frequency-selective fading 
channels was considered. This paper presented 
results on a novel modulation, diversity, and 
multiple access technique. 

For two diversity techniques considered, 
MRC performed better than EGC. 

The performance of MC-CDMA would be 
affected by nonlinearities introduced from 
HPA's in the transmitter. 

From previous discussions and plotted results, 
it can be concluded that in order to reduce the 
sensitivity of a MC-CDMA system to the 
nonlinear amplification, it is recommended to 
choose a special technique to mitigate these 
distortions. PD schemes had been selected to do 
this mission, they attended to achieve a 
significantly improve overall system 
performance. 



REFERENCES 

[1] J. long, and Z. Chen, " Theoretical Analysis 
of an MC-CDMA System in Cellular 
Environments" Journal of Science and 
Engineering Technology, Vol. 6, No. 1, pp. 
37-50,2010. 
[2] M. Frikel, S. Safi, B. Targui, and M. Saad," 
Channel Identification Using Chaos for an 
Uplink/Downlink Multicarrier Code Division 
Multiple Access System", JOURNAL OF 
TELECOMMUNICATIONS AND 
INFORMATION TECHNOLOGY, Vol. 1, 
pp. 48-54, 2010. 
[3] M. Chen, and O. Collins, " Multilevel Coding 
for Nonlinear ISI Channels", IEEE 
TRANSACTIONS ON INFORMATION 
THEORY. VOL. 55, NO. 5, MAY 2009. 
[4] J. Gazda, P. Drotar, D. Kocur, and P. 
Galajda," Performance Improvements of 
MC-CDMA Microstatistic Multi-User 
Detection in Nonlinear Fading Channels 
Using Spreading Code Selection," TUBITAK. 
Turk J Elec Engin. VOL. 8, NO. 1, April 
2009. 



[5] N. Kumaratharan, S. Jayapriya, and P. 

Dananjayan, " Performance Enhancement of 
MC-CDMA System through STBC based 
STTC Site Diversity", International Journal 
of Computer and Electrical Engineering, 
Vol. 2, No. 1, February, 2010. 
[6] B.Ness, "EQUAL BER POWER 

CONTROL FOR UPLINK MC-CDMA 
WITH MMSE SUCCESSIVE 
INTERFERENCE CANCELLATION," 
Patent No. US 7,620,096 B2, Nov. 2009. 
[7] P. Reddy, and V. Reddy," BER Degradation 
of MC-CDMA at high SNR with MMSE 
Equalization and Residual Frequency 
Offset," EURASIP Journal on Wireless 
Communications and Networking, Volume 
2009, Article ID 293264,2009. 
[8] J. long, and Z. Chen, " PERFORMANCE 
ANALYSIS OF MC-CDMA 
COMMUNICATION SYSTEMS OVER 
NAKAGAMI-M ENVIRONMENTS", 
Journal of Marine Science and Technology, 
Vol. 14, No. 1, pp. 58-63, 2006. 
[9] Z. Hou, and V. dubey, " BER Performance 
for Downlink MC-CDMA Systems over 
Rician Fading Channels", EURASIP 
Journal on Applier Signal Processing, pp. 
709-717, 2005. 
[10] N. Yee, J. Linnartz, and G. Fettweis," 
MULTICARRER CDMA IN INDOOR 
WIRELESS RADIO NETWORKS", The 
Fourth International Symposium on 
Personal Indoor and Mobile Radio 
Communications, PACIFICO 
YOKOHAMA, JAPAN, September. 1993. 
[11] A. Perotti, P. Rrmlein, and S. Benedetto, " 
Adaptive Coded Continuous-Phase 
Modulations for Frequency-Division 
Multiuser Systems", ADVANCES IN 
ELECTRONICS AND 
TELECOMMUNICATIONS, VOL. 1, NO. 
1, APRIL 2010 
[12] S. Chang, "An efficient compensation of 
TWTA's nonlinear distortion in wideband 
OFDM systems", IEICE Electronics 
Express, Vol. 6, No. 2, pp. 111-116, 2009. 
[13] T.Tan," POWER AMPLIFIER 

MODELING AND POWER AMPLIFIER 
PREDISTORTION IN OFDM SYSTEM", 
Journal of Science & Technology 
Development, Vol 11, No.02 - 2008 



70 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 8, No. 8, November 2010 



Channel Estimation Algorithms, Complexities 
and LTE Implementation Challenges 

Md. Masud Rana 

Department of Electronics and Communication Engineering 

Khulna University of Engineering and Technology 

Khunla, Bangladesh 



Abstract — The main purposes of the long term evolution 
(LTE) are substantially improved enduser throughputs, 
low latency, reduced user equipment (UE) complexity, 
high data rate, and significantly improved user experience 
with full mobility. LTE uses single carrier-frequency 
division multiple access (SC-FDMA) for uplink 
transmission and orthogonal frequency division multiple 
access (OFDM A) for downlink transmission. The major 
challenges for LTE terminal implementation are efficient 
channel estimation (CE) method as well as equalization. 
This paper discusses the basic CE techniques and future 
direction for research in CE fields. Simulation results 
demonstraters that the linear mean square error (LMMSE) 
CE method outperforms the least square (LS) CE method 
in term of mean square error (MSE) by more than around 
3dB. Hence, based on a given LTE systems resources and 
specifications, a appropriate method among the presented 
methods can be applied for OFDMA systems. 

Keywords— LS, LMMSE, LTE, OFDMA. 



I. INTRODUCTION 

The wireless evolution has been stimulated by an 
explosive growing demand for a wide variety of high 
quality of services in voice, video, and data. This 
rigorous demand has made an impact on current and 
future wireless applications, such as digital audio/video 
broadcasting, wireless local area networks (WLANs), 
worldwide interoperability for microwave access 
(WiMAX), wireless fidelity (WiFi), cognitive radio, and 
3rd generation partnership project (3 GPP) long term 
evolution (LTE) [1], [2]. LTE uses single carrier- 
frequency division multiple access (SC-FDMA) for 
uplink transmission and orthogonal frequency division 
multiple access (OFDMA) for downlink transmission 
[3], [4]. SC-FDMA utilizes single carrier modulation 
and frequency domain equalization, and has similar 
performance and essentially the same overall 
complexity as those of OFDMA system. These 
advanced applications in which the transmitted signal 
disperses over the time and the frequency domains, 
show the need for highlydeveloped signal processing 
algorithms. In particular, one of the main challenges in 
the mobile communication is a wireless channel that 
suffers from numerous physical impairments due to 
multipath propagation, interference from other users or 
layers, and the time selectivity of a channel [5-9]. 



Many CE techniques have already been proposed for 
the LTE OFDMA systems. The simple least square (LS) 
algorithm, which is independent of the channel model, 
is commonly used in CE [10-14]. But the radio channel 
is time-variant; hence a method has to be found in order 
to perform estimation in a time-varying channel. The 
minimum mean- squared error (MMSE) estimate has 
been shown to be better than the LS estimate for CE in 
wireless communication systems [15]. The important 
problem of the MMSE estimate is its high 
computational complexity, which grows exponentially 
with inspection samples [16]. In [17], a low rank 
approximation is applied to a linear MMSE (LMMSE) 
estimator that employs the correlations of the channel. 
To further improve the system performance, Wiener 
estimation has been investigated [18]. Although it 
exhibits the best performance among the existing linear 
algorithms, it requires accurate knowledge of second 
order channel statistics, which is not always feasible at a 
mobile receiver. Also, this scheme requires higher 
complexity. 

This paper outlines the developments of the LTE 
OFDMA systems, and highlights some upcoming 
challenges, where advanced signal processing could 
play a important role in resolving them. Specifically, we 
investigates various types of CE techniques such as LS, 
and LMMSE CE methods and find out which is the 
more efficient one. The performance is measured in 
terms computational complexity, and mean square error 
(MSE). Simulation results shows that the LMMSE CE 
algorithm outperforms the existing LS CE in term of 
MSE by more than around 3dB. Hence, based on a 
given LTE systems resources and specifications, a 
appropriate method among the presented methods can 
be applied. 

The rest of the paper is organized as follows. We give 
a brief overview of the wireless communication systems 
in section II. The classification of CE is described in 
section III. The LS and LMMSE CE methods are 
describes in section IV and its performance are analyzed 
in section V. In section VI, we highlight the challenges 
for LTE terminal implementation. Finally, some 
conclusions are made in section VII. 

The following notations are used in this paper: bold 
face lower and upper case letters are used to represent 

vectors and matrices respectively. Superscripts X 
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X denote the transpose and congugate transpose of the 
X , and I is the identity matrix. 

II. COMMUNICATION SYSTEMS 

Nowadays, cellular mobile phones have become an 
important tool and part of daily life. In the last decade, 
cellular systems have experienced fast development and 
there are currently about two billion users over the 
world [6]. Mobile penetration is based on population, 
pay TV and broadband is households. 




2004 2005 2006 2007 2008 2009 2010 2011 

User 

Fig. 1 Mobile is the key growth platform. 

The idea of cellular mobile communications is to 
divide large zones into small cells, and it can provide 
radio coverage over a wider area than the area of one 
cell. This concept was developed by researchers at AT 
& T Bell laboratories during the 1950s and 1960s. The 
initial cellular system was created by Nippon telephone 
& telegraph (NTT) in Japan, 1979. From then on, the 
cellular mobile communication has evolved. 

The mobile communication systems are frequently 
classified as different generations depending of the 
service offered. The first generation (1G) comprises the 
analog communication techniques, and it was mainly 
built on frequency modulation (FM) and frequency 
division multiple accesses (FDMA). Digital 
communication techniques appeared in the second 
generation (2G) systems, and main access schemes are 
time division multiple access (TDMA) and code 
division multiple access (CDMA). The two most 
commonly accepted 2G systems are global system for 
mobile (GSM) and interim standard-95 (IS-95). These 
systems mostly offer speech communication, but also 
data communication limited to rather low transmission 
rates [7]. The concept of the third generation (3G) 
system started operations on October, 2002 in Japan. 



The 3 GPP members started a feasibility study on the 
enhancement of the universal terrestrial radio access 
(UTRA) in December 2004, to improve the mobile 
phone standard to cope with future requirements. This 
project was called LTE [5], [8]. 3GPP LTE uses SC- 
FDMA for uplink transmission and OFDMA for 
downlink transmission [9]. Fig. 2 summarizes the 
cellular mobile communication systems and its access 
schemes [10]. 
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Fig.2 (a): Evolution path in mobile communication 
systems. 
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Fig. 2 (b) Multiple access schemes. 

From the beginning wireless communications there is 
a high demand for realistic mobile fading channels. The 
motive for this significance is that efficient channel 
models are necessary for the investigation, design, and 
deployment of wireless communication system for 
reliable transfer of information between two parties. 
Correct channel models are also important for testing, 
parameter optimization, and performance evolution of 
wireless communication systems. The performance and 
complexity of signal processing algorithms, transceiver 
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designs, and smart antennas, employed in future 
wireless communication systems, are highly dependent 
on design methods used to model mobile fading 
channels. The effect of the channel on the transmitted 
information must be estimated in order to recover the 
transmitted information correctly [11]. 



transmitted at certain positions of the OFDMA 
frequency time pattern, in its place of data as shown in 
Fig. 4. An amount of training sequences is raise the 
accuracy of CE, but it is reduces the system efficiency, 
because there isn't any new information in the training 
symbols. 



III. CLASSIFICATION OF CE 

Channel can be described everything from the 
source to the destination of a radio signal. This 
includes the physical medium between the transmitter 
and the receiver through which the radio signal 
propagates. On the other hand, CE is the process of 
characterizing the effect of the physical channel on the 
input sequence. It can be employed for the purpose of 
detecting received signal, improve signal to noise ratio 
(SNR), channel equalization, reduced ISI, mobile 
localization, and improved system performance [8], [9]. 
In general, both iterative and noniterative CE 
techniques can be divided into three categories such as 
the training CE, blind CE, and semi-blind CE [10], 
[21]. 





CE 






1 






Iterative and non-iterative CE 






1 






1 






' 






1 




Preamble based training CE 




1 




' 


' 


Data aided 
CE 






Blind CE 




Semi-blind CE 




















Pilot based training CE 













« A 
















Q3 
JO 


o 


o 


o 


o 


o 


o 


o 


o 


CO 


o 


o 


o 


o 


o 


o 


o 


o 




o 


o 


o 


o 


o 


o 


o 


o 




o 


o 


o 


o 


o 


o 


o 


o 




o 


o 


o 


o 


o 


o 


o 


o 




o 


o 


o 


o 


o 


o 


o 


o 




o 


o 


o 


o 


o 


o 


o 


o 




o 


• 


o 


o 


o 


o 


• 


o 



CD Data symbol 



Pilot symbol 



Fig. 3 Outline of the CE. 



Fig. 4 Positions of data and pilot symbols. 



B. Blind CE 

A blind CE method requires no training sequences 
[13]. They exploit certain underlying mathematical 
information regarding the type of data being transmitted. 
These CE methods might be bandwidth efficient but 
still have their own downfalls. These methods are 
enormously computationally intensive and convergence 
is slow [21]. A popular category of blind CE method is 
decision directed algorithms. These methods rely upon 
the demodulated and detected signal at the receiver to 
reform the transmitted signal. The drawback of these 
CE algorithm is that a bit error at the receiver is cause 
the construction of an erroneous transmitted sequence. 

C. Semi-blind CE 



A. Taining CE 

The training CE algorithm requires probe sequences; 
the receiver can use this probe sequence to reconstruct 
the transmitted waveform [10]. It has the advantage of 
being used in any radio communications system quite 
easily. Even if this is the most popular CE method, it 
still has its drawbacks. The drawback of training 
sequence methods is that the probe sequence occupies 
valuable bandwidth, reducing the throughput of the 
communication system. This scheme also suffers due to 
the fact that most communication systems send 
information lumped frames. It is only after the receipt of 
the entire frame that the channel estimate can be 
reconstructed from the embedded probe sequence. Since, 
the coherence time of the channel might be smaller than 
the frame time, for rapid fading channels this CE might 
not be sufficient. Training symbols can be placed either 
at the beginning of each burst as a preamble or regularly 
through the burst [21]. Training sequences are 



Semi-blind CE methods are used a combination of 
data aided and blind methods [11]. Since, there are a 
large number of channel coefficients, a large number of 
pilot symbols may be required. It would result in a 
decrease of data throughput. To avoid it, the semi-blind 
CE methods with fewer pilot symbols can be used. As a 
result, improve system performance in compared with 
using equal pilots in LS method. Moreover, there is a 
trend to use superimposition of pilot and data symbols. 
In fact, these methods by superimposing pilot and data 
symbols in the same time economize the system 
bandwidth. But in superimposed training sequence 
scheme, there is disadvantage due to the interference of 
information data. So, an accurate CE has been one of 
the most important issues for reliable mobile 
communication systems. So, CE can be performed by 
many ways inserting pilot tones into each OFDMA 
symbol with a specific period or blind CE. 
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IV. LS and LMMSE CE ALGORITHMS 

Pilot estimators are often achieved by multiplexing 
training sequence into the data sequence. These pilot 
symbols allow the receiver to extract channel 
attenuations and phase rotation estimates for each 
received symbol, facilitating the compensation of 
channel fading envelope and phase. A general CE 
procedure for communication system is shown in Fig. 5. 




Any CE algorithm 



Fig. 5 General CE procedure. 

The signal S is transmitted via a unknown time- 
varying channel w, and corrupted by an additive white 
Gaussian noise (AWGN) z, before being detected in a 

receiver. The channel coefficient W est , is estimated 

using any kind of CE method. In the channel estimator, 
transmitted signal S is convolved with an estimate of 

the channel W est . The error between the received 

signal and its estimate is 

e = r-r gB (1) 

The aim of most CE algorithms is to minimize the MSE, 
while utilizing as little computational resources as 
possible in the estimation process. 

The idea behind LS CE method is to fit a model to 
measurements in such a way that weighted errors 
between the estimation and the true model are 
minimized [14]. The received signal can be written as 
vector notation as 

r = Sw + z, (2) 

where r = [r p r 2 r L ] is the received signal, 

S = diag[s p S 2 S L ] is the transmitted signal, 

W = [w p W 2 W L ] is the unknown channel 

coefficients, and z = [z v Z 2 Z L ] is additive white 

Gaussian noise (AWGN). The LS estimate of such a 
system is obtained by minimizing square distance 
between the received signal and its estimate as [14] 

j = (r-Sw)(r-Sw) t . (3) 

Now differentiate this with respect to W and set the 
results equal to zero to produce [14]: 



w LS = (pi + SSyVr, (4) 

where p is regularization parameter and has to be 
chosen such that the resulting eigenvalues are all 

defined and the matrix (pi + SS)~ is least perturbed. 
Here the channel is considered as a deterministic 
parameter and no knowledge on its noise statistics is 
needed. The LS estimator is computationally simple but 
problem is that the inversion of the square matrix turns 
out to be ill-conditioned (sometime). So, it will need to 
regularize the eigenvalues of the matrix to be inverted 
by adding a small constant term to the diagonal [14]. 

MMSE CE method proposes at the minimization of 
the MSE between the actual and estimated channel 
impulse response (CIR). The most important problem of 
the MMSE estimate is its high computational 
complexity, which grows exponentially with inspection 
samples [15], [16]. In [17], a low rank approximation is 
applied to a linear MMSE (LMMSE) estimator that 
employs the correlations of the channel. The general 
expression of LMMSE is described as 



w^ 



= R(R + n/SNR)-V 



LS' 



(5) 
where R ww is the auto-covariance matrix of w, W LS is 

the channel response in LS estimation, and r is a constant 
depending on the modulation constellation 



r = E[|S k | 2 ]E[|l/S k | 2 ]. 



(6) 



For QPSK modulation, r is 1[17]. Here, W LS is not 
very important issue in the matrix computation, the 
inversion of R ww does not require to be estimated every 

time the transmitted sybmols in W LS varies. Also, if 

signal to noise ratio (SNR) and R ww are identified 

earlier or are set to fixed nominal values, the matrix 

(R ww + ri/SNR)" needs to be computed at once. 

Under these situation, the estimation requires L 
multiplications per tone. 

In order to calculate computational complexity, we 
assume that the evaluation of the scalar addition or 
subtraction needs L addition and multiplying the scalar 
by the vector requires L multiplications, and multiplying 
two matrix need 4L multiplications and 4L-1 additions. 
Table I summarizes the computational complexity of the 
different CE methods. 



Table I Com] 


plexity of the CE methods 


Operation 


LSCE 


LMMSE CE 


Matrix inversion 


1 


2 


Multiplication 


11L 


17L 


Addition 


11L-3 


17L-5 
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We calculate the number of complex addition and 
multiplications which are needed to implement the 
algorithm. It shows that the LS CE algorithm has lower 
complexity than LMMSE method. For this LMMSE 
estimator, the main contribution to the complexity 

comes from the term R ww (R ww + n/SNR)" 1 . 



V. ANALYTICAL RESULT 

In this simulations, we consider a system operating 
with a bandwidth of 1.25MHz, with a total symbol 
period of 520jis, of which 10 jis is a cyclic prefix. The 
entire channel bandwidth is divided into 128 sub- 
carriers, implemented by 128-point IDFT. Sampling is 
performed with a 1.92MHz. The data symbol is based 
on BPSK. In practice, the ideal channel coefficient is 
unavailable, so estimated channel coefficient must be 
used instead. The more accurate estimated channel 
coefficient is, the better MSE performance of the CE 
will achieve. The performance is measured using MSE 
between the actual and the estimated channel response. 
Fig. 6 shows the MSE versus SNR for the different 
channel estimators. We can see that LMMSE CE can 
always achieve better performance than LS CE. The 
main reason is, LMMSE CE method uses channel 
correlation as well as SNR but the LS CE method does 
not uses channel correlation. Finally, it concludes that 
the LMMSE CE method has higher computational 
complexity and around 3dB better performance 
compared with the LS CE method. 




Fig. 6 MSE of the LS and LMMSE CE methods. 



VI. IMPLEMENTATION CHALLENGES 

LTE meets the important obligations of next 
generation mobile communications, but still falls short 



on some preferred requirements such as cell-edge 
spectral efficiency in the uplink transmission [19]. LTE 
implementation poses the following signal processing 
challenges in terms of performance, cost and power 
consumption: 

• Regrettably, the development of data rates is not 
matched by advanced in semiconductor structures, 
terminal power consumption improvements. 
Therefore, advanced signal processing architectures 
as well as algorithms are needed to cope with these 
data rates [19]. 

• High performance multiple input multiput output 
(MIMO) receivers such as sphere decoders, 
maximum likelihood receivers offer substantial 
system performance gains but enforce an 
implementation challenge, especially when the high 
peak data rates are targeted [19]. 

• LTE utilizes precoding, which requires accurate CE. 
Advanced methods like iterative decision directed 
CE and pilot based CE offer system performance 
improvements, but pose again a computational 
complexity challenge [19]. 

• LTE has a large "toolkit" of MIMO methods and 
adaptive methods. The choice and combination of 
the accurate technique in a cell with heterogeneous 
devices, channel conditions and bursty data 
services is a challenge [19]. 

• It is very difficult to implement many antennas in a 
small hand portable unit. In near future, we need to 
use wearable antenna on head. 

• LTE roll-out will be gradual in most cases- 
interworking with other standards such as GSM or 
HSPA is required for a long time. This imposes not 
only a cost and computational complexity issue. 
One of the reasons many early 3G terminals had 
poor power consumption was the need for second 
generation (2G) cell search and handover in 
addition to normal 3G operation. Reduced talk- time 
for dual- mode devices is not suitable [19]. Fig. 7 
shows the estimated complexity based on the 
baseline receiver. Note that the complexity of the 
LTE receiver grows linearly with respect to system 
bandwidth and the corresponding maximum 
nominal throughput. Interestingly, MIMO mode 
requires less than double the SIMO mode 
complexity. 
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Fig. 7 Complexity of LTE receiver. 



VII. CONCLUSION 

An accurate CE is one of the most important issues for 
reliable future wireless communication systems such as 
LTE. In this paper, we briefly insvesteget LS and 
LMMSE CE techniques for LTE terminal implemtation. 
Simulations demonstrated that the MSE performance of 
the LMMSE CE algorithm is at least 3dB better than 
existing LS estimator. Even though, the LMMSE CE 
technique requires a little high computational complexity, 
the advantage in the MSE and convergence towards true 
channel coefficient may be significantly useful for future 
mobile communications which allow broadband 
multimedia Internet access and wireless connection 
anywhere, and any time. This paper also discusses the 
challenges imposed by developments in the LTE terminal 
implementation. Hence, based on a given LTE systems 
resources and specifications, a appropriate method 
among the presented methods can be applied. 
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Abstract 

This paper presents application of wavelet and Radial 
Basis Function (RBF) for power quality disturbance 
classification. Features are extracted from the electrical 
signals by using db wavelets. The features obtained 
from the wavelet are unique to each type of electrical 
fault. These features are normalized and given to the 
RBF. The data required are generated by simulating 
various faults in the test system. The performance of 
the proposed method is compared with the existing 
feature extraction techniques. Simulation results show 
the effectiveness of the proposed method for power 
quality disturbance classification. 

Keywords wavelets, Radial basis function (RBF), 
Harmonics, Power quality 

1. Introduction 

Electrical fault arises due to sudden loads, failure in 
the electrical circuits, lightning, conducted or 
radiated low and high frequency phenomena. Due to 
electrical fault in the system, the power quality 
deteriorates which is an indication of slow failure of 
the equipment. Many algorithms have been 
developed by previous researchers for classification 
of electrical disturbances. Integrated Fourier linear 
combiner and fuzzy expert system [1] were used for 
the classification of transient disturbance waveforms 
in a power system. S-Transform and two dimensional 
time-time (TT) transform [2] have been 
implemented for electrical fault identification. 
Patterns generated by S-Transform and TT transform 
are unique and hence accuracy of identification is 
high. An adaptive neural network approach for the 
estimation of harmonic distortions and power quality 
in power networks are implemented [3]. A hybrid 
system to automatically detect, locate and classify 
disturbances affecting power quality in an electrical 
power system is presented [4]. Least absolute value 
(LAV) State estimation algorithm has been used to 
measure the flicker voltage magnitude [5]. The 
Simulated Annealing (SA) optimization algorithm 
has been used for measuring the voltage flicker 
magnitude, frequency and the harmonics contents of 



the voltage signal for power quality analysis [6]. An 
algorithm to detect the fundamental frequency is 
proposed. It is based on the chirp-z transform (CZT) 
spectral analysis and is able to 

observe all standards in force because of its accuracy 
and working characteristics [7]. 

A wavelet-based neural classifier integrating the 
DWT [8] [9], learning vector quantization (LVQ) 
neural network, and decision-making scheme to 
become an actual power disturbance classifier. The 
classifier employed the DWT coefficients as inputs 
to multiple LVQ neural networks to train and 
perform waveform recognition, and use the decision- 
making schemes to classify the transient disturbance 
type. A novel classifier using a rule-based method 
and a wavelet packet-based hidden Markov model 
(HMM) [10]. The rule-based method is employed to 
classify the time-characterized feature disturbances, 
while the wavelet packet-based HMM is utilized to 
categorize the frequency-characterized feature power 
disturbances. Wavelet-multi-resolution 

decomposition that combines frequency-domain with 
time-domain analysis for power disturbance feature 
extraction is proposed [11]. Extracting the features 
from the wavelet transform coefficients at different 
scales as inputs to neural networks for classifying 
the nonstationary signal type have been proposed 
[12]. A wavelet norm entropy-based effective feature 
extraction method for power quality disturbance 
classification problem has been done by [9]. 
Extracting the squared wavelet transform coefficients 
(SWTC) at each scale as inputs to the neural 
networks for classifying the electrical disturbance is 
proposed [13] [14] [15]. Multi-wavelet-based neural 
networks with learning vector quantization network 
are used for power quality disturbances as a powerful 
classifier [16]. The DWT coefficients as inputs to a 
single-layer self-organizing map neural network to 
train and classify the transient disturbance has been 
used [17]. Fuzzy ARTMAP, Back propagation 
algorithm and Radial Basis Function (RBF) network 
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in combination with S Transform, Wavelet transform 
and Hilbert Transform (HT) for classifying power 
faults have been used [18]. DWT coefficients have 
been used as inputs to a refined neuro-fuzzy network 
to train and classify the power system disturbance 
[19]. Continuous wavelet transform (CWT) has been 
used to estimate the disturbance time duration and 
the DWT to estimate the disturbance amplitude [20]. 
The two features thus obtained are then used to 
classify the transient disturbance type. The authors 
have claimed HT with RBF gives more fault 
classification from a set of 6 different types of faults 
with 3000 signals generated using Matlab. 

Algorithms should be developed that can classify the 
type of harmonics and other electrical disturbances. 
Due to generation of non- stationary random 
disturbances, many existing classification algorithms 
are provided with intelligence using artificial neural 
networks, fuzzy logic and evolutionary algorithms. 
The artificial neural networks (ANNs) can classify 
the electrical disturbances in the presence of noise in 
the signal. In this work, RBF has been implemented 
for electrical fault classification. 

2. Proposed Methodology 

This research work proposes wavelets for feature 
extraction and RBF for classification of electrical 
fault. In order to achieve maximum classification, 
proper data input, optimum topology and correct 
training of RBF with suitable parameters is a must. A 
large amount of patterns are generated from different 
fault conditions. Twelve features are generated for 
each pattern. The faults considered are instantaneous 
interruption, instantaneous sag, instantaneous swell, 
momentary interruption, momentary sag, momentary 
swell, temporary interruption, temporary sag, 
temporary swell and harmonics. The features 
obtained are mean, standard deviation, norm, 
maximum and minimum of signal for Approximation 
and Details at the 5 th level decomposition. These 
features are given as inputs for the RBF neural 
network along with labeling that indicates a fault. 
Training the RBF gives final weights. The final 
weights are used to identify a fault during testing. 
Fig. 1 explains the overall sequence of proposed 
methodology. 

Steps in feature extraction from the input signal 

1. Input voltage wave form is sampled, f s = 12800 Hz. 

2. Decomposition of the signal by wavelets till 5 th 
level. 

3. Ten features are obtained from Approximation 
and Detail in the 5 th level 



4. Input the extracted features into the input layer of 
features. 

5. Train the RBF network and store the final 
weights. 

6. Test the RBF network with new signal having a 
fault. 



3. Wavelet decomposition 

Wavelet is a process of analyzing the signal 
with scaling and shifting to obtain the details of the 
signal. It decomposes the signal into high frequencies 
(Detail) and low frequencies (Approximation). The 
first time getting Approximation and Detail is called 
level 1 decomposition (Figure 1). Subsequent 
decompositions are done, by using the 
Approximation or Detail obtained in level 1, to 
further get the required information. In every level of 
decomposition, the number of samples in the signal 
is reduced to 50% of the total samples to 
Approximation and remaining 50% of the samples to 
Detail. During subsequent decomposition, only 50% 
of the samples (Approximation will be further used 
for getting Approximation and Detail in the 
subsequent level decomposition). 



Electrical fault 



Approximation 
(Al) 6400 



Details (Al) 
6400 



Approximation 
(A2) 3200 



Details(Al) 
3200 



Level 1 



Level 2 



Approximati 
on (A2) 400 



Details(Al) 
400 



Level 5 



Figure 1 Levels of decomposition 
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Short duration variations-Instantaneous Swell with harmonics 




0.3 0.4 0.5 0.6 

Time (sec) 



Figure 2 Short duration variations- 
Instantaneous swell with harmonics 



The Approximation and Details for different 

conditions of the signals are given in Figure 3 - 
Figure 5. 

Approximation Al Detail Dl 




100 200 300 400 




100 200 300 400 



A fault signal has been shown in Fig.2. This 
signal has been generated by combining pure 
sinusoidal wave form with harmonics and with swell 
as per the data available in Table- 1. 



Figure 3 Fifth level decomposition of 
Instantaneous swell 



Table 1 Categories of Power System Faults 


Category 


Duration 


Voltage 


Harmonics 


Present in the 
entire signal 


0.0035- 
0.087 pu 


Short duration variation- 
Instantaneous 
Interruption 


0.5 -30 cycles 


O.lpu 


Short duration variation- 
Instantaneous Sag(dip) 


0.5 -30 cycles 


0.1 - 0.9 
pu 


Short duration variation- 
Instantaneous Swell 


0.5 -30 cycles 


1.1 -1.8 
pu 


Short duration variation- 
Momentary Interruption 


30 cycles -3s 


O.lpu 


Short duration variation- 
Momentary Sag(dip) 


30 cycles -3s 


0.1 - 0.9 
pu 


Short duration variation- 
Momentary Swell 


30 cycles -3s 


1.1 -1.8 
pu 


Short duration variation- 
Temporary Interruption 


3s- lmin 


O.lpu 


Short duration variation- 
Temporary Sag(dip) 


3s- lmin 


0.1 - 0.9 
pu 


Short duration variation- 
Temporary Swell 


3s- lmin 


1.1 -1.8 
pu 



Approximation Al 



Detail Dl 




100 200 300 400 




100 200 300 400 



Figure 4 Fifth level decomposition of 
Harmonicssignal 



Approximation Al 





100 200 300 400 100 200 300 400 
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Figure 5 Fifth level decomposition of 

Instantaneous_interruption_pointl_3cycles_no_h 

armonics 

It can be noted from the Figures 3-5 that the wavelet 
decomposition really helps in identifying the 
presence of disturbances. In order to make it more 
clear features are extracted from the approximation 
and details. 

The features are obtained from the Approximation 
and Details of the 5 th level by using the following 
equations 



V1 =^ (IF) 



(i) 



Where d = Samples in a frame and VI 
value of Instantaneous Frequency 



Mean 



V2 



^5>- 



VI) 



(2) 



Where V2=Standard Deviation of Instantaneous 
Frequency 

Where V2=Standard Deviation of Instantaneous 
Frequency 
V3 = Maximum(IF) 



V4 = Minimum(IF) 
V5 = norm(IF^ 

Where V5 = Energy value of frequency 



(3) 
(4) 

(5) 



4. Radial basis function 

Radial basis function is a supervised neural network. 
The network has an input layer, hidden layer (RBF 
layer) and output layer. The 10 features obtained are 
used as inputs for the network and the target values 
for training each fault is based on the values given in 
Table 2. 

Training RBF is done as follows, 

1. Distance between pattern and centers are 
found. 

2. An RBF matrix whose size will be (np X 
cp). , where np= number of pattern (50 
signals in each fault X number of faults) 
used for training and cp is number of 
centers which is equal to 10. 

3 . Final weights are calculated. 

4. During testing the performance of the RBF 
network, RBF values are formed from the 
features obtained from a signal and 
processed with the final weights obtained 
during training. Based on the result 



obtained, the electrical fault signal is 
classified. 
5. Experiment and Results 

Matlab 2009 has been used to generate 300 signal 
patterns for each electrical fault. The equation used 
for generating an electrical fault signal is as follows: 

y = A sin(2 n f t + (p) (6) 

Where f = Frequency of the signal, A = Amplitude of 
the signal and cp= Phase angle. 

Only one wave form is created when considering 
pure signal,. Different harmonics with different 
amplitudes are generated and summed. The 
harmonics waveform is added to fault signals. 
Harmonics signals are generated with f h (1 st to 25 th 
harmonics) and voltage of the harmonics, v h (0.004 
pu to 0.09pu). Similarly for each fault, swell, sag, 
interruption under instantaneous, momentary and 
transient, 300 samples are created with varied time 
duration, varying amplitudes with 50 Hz as constant 
frequency given in Table 2. Hence, a total of (10 
faults * 300) + 1 pure signals are generated using 
equation (6). At random, 20 signals are considered 
from each fault for training RBF. They are given in 
Table 2. 



Table 2 Faults Labeling 




Number 
of 

patterns 
used for 
testing 


Number of 
patterns 
used for 
training 


Targe 
t 

label 
ing 


Pure sine wave 


1 


1 


0.01 


Instantaneous 
Interruption 


250 


50 


0.02 


Momentary 
Interruption 


250 


50 


0.03 


Temporary 
Interruption 


250 


50 


0.04 


Temporary sag 


250 


50 


0.05 


Instantaneous sag 


250 


50 


0.06 


Momentary sag 


250 


50 


0.07 


Temporary swell 


250 


50 


0.08 


Momentary swell 


250 


50 


0.09 


Instantaneous swell 


250 


50 


0.1 


Harmonics 


250 


50 


0.11 



A counter is used to make a note in how many 
frames have the same values of frequency and the 
voltage occurs continuously If duration of existence 
of fault is in multiples of 8, 16, 24, 32, 40, 48 cycles 
or more than one second then. Based upon the 
number of similar values in the successive frames, a 
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fault is classified. In a signal of 1 sec, minimum 2 
faults are introduced along with harmonics. 



6. Conclusion: 

In this work, wavelet (dbl) has been used to obtain 
Approximation and Details at the 5 th level 
decomposition. As the initial sample size is 12800 in 
the sampled signal, in the 5 th level decomposition, 
400 samples for Approximation and 400 samples for 
Details are obtained. The features mean, std, norm, 
minimum and maximum of the Approximation and 
Details are obtained. The features are used as 
training and testing data for the RBF network. The 
percentage of electrical fault identification in given 
in Table 3 



Table 3 Electrical fault identification 


Faults 


Number 

of 
patterns 
identified 


Number 

of 
patterns 
used for 
testing 


% 
identification 

and 
classification 


Pure sine wave 


1 


1 


100 


Instantaneous 
Interruption 


235 


250 


94 


Momentary 
Interruption 


238 


250 


95.2 


Temporary 
Interruption 


247 


250 


98.8 


Temporary sag 


241 


250 


96.4 


Instantaneous 
sag 


238 


250 


95.2 


Momentary sag 


247 


250 


98.8 


Temporary swell 


238 


250 


95.2 


Momentary swell 


247 


250 


98.8 


Instantaneous 
swell 


247 


250 


98.8 


Harmonics 


241 


250 


96.4 
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Abstract- Genome Analysis of a human being permits useful 
insight into the ancestry of that person and also facilitates the 
determination of weaknesses and susceptibilities of that person 
towards inherited diseases. The amount of accumulated 
genome data is increasing at a tremendous rate with the rapid 
development of genome sequencing technologies and gene 
prediction is one of the most challenging tasks in genome 
analysis. Many tools have been developed for gene prediction 
which still remains as an active research area. Gene prediction 
involves the analysis of the entire genomic data that is 
accumulated in the database and hence scrutinizing the 
predicted genes takes too much of time. However, the 
computational time can be reduced and the process can be 
made more effective through the selection of dominant genes. 
In this paper, a novel method is presented to predict the 
dominant genes of ALL/AML cancer. First, to train an FF- 
ANN a combinational data of the input dataset is generated 
and its dimensionality is reduced through Probability Principal 
Component Analysis (PPCA). Then, the classified database of 
ALL/AML cancer is given as the training dataset to design the 
FF-ANN. After the FF-ANN is designed, the genetic algorithm 
is applied on the test input sequence and the fitness function is 
computed using the designed FF-ANN. After that, the genetic 
operations crossover, mutation and selection are carried out. 
Finally, through analysis, the optimal dominant genes are 
predicted. 

Keywords- gene prediction, Microarray gene expression 
data, Probabilistic PC A (PPCA), dimensionality reduction, 
Artificial Neural Network (ANN), Back propagation (BP), 
dominant gene, genetic algorithm. 



I. INTRODUCTION 

In the public domain huge quantity of genomic and 
proteomic data are accessible. The capability to process this 
information in ways that are helpful to humankind is 
becoming more and more significant [1].A fundamental 



step in the understanding of a genome is the computational 
recognition, and in the analysis of newly sequenced 
genomes it is one of the challenges. Accurate and speedy 
tools are essential for the analysis of genomic sequences and 
for interpreting genes [2]. In such circumstances, 
conventional and modern signal processing techniques plays 
a vital part in these fields [1]. Genomic signal processing 
[11] (GSP) is a comparatively novel area in bio-informatics. 
It deals with the utilization of traditional digital signal 
processing (DSP) techniques in the representation and 
analysis of genomic data. 

The code for the chemical composition of a 
particular protein is enclosed in the DNA which is a 
segment of gene. Genes functions as the pattern for proteins 
and some extra products, and the main intermediary that 
translates gene information in the production of genetically 
encoded molecules is mRNA [4]. Usually sequences of 
nucleotide symbols, symbolic codons (triplets of 
nucleotides), or symbolic sequences of amino acids in the 
corresponding polypeptide chains present in the strands of 
DNA molecules represent the genomic information. [2]. 
Gene expression microchip, which is perhaps the most 
rapidly expanding tool of genome analysis enables 
simultaneous monitoring of the expression levels of tens of 
thousands of genes under diverse experimental conditions. 
An influential tool in the study of collective gene reaction to 
changes in their environments is presented by gene 
expression microchip, and it also offers indications about 
the structures of the involved gene networks [3]. 

Nowadays, in a solitary experiment by employing 
microarrays the expression levels of thousands of genes, 
possibly all genes in an organism can be measured 
simultaneously [4]. In monitoring genome- wide expression 
levels of gene microarray technology has become a requisite 
tool [5]. The evaluation of the gene expression profiles in a 
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variety of organs which employs microarray technologies 
disclose separate genes, gene ensembles, and the metabolic 
ways underlying the structural and functional organization 
of an organ and its physiological function [6]. By the 
employment of microarray technology the diagnostic chore 
can be automated and the precision of the conventional 
diagnostic techniques can be enhanced. Simultaneous 
examination of thousands of gene expressions is being 
facilitated by microarray technology [7]. 

Efficient representation of cell characterization at 
the molecular level is possible with microarray technology 
which simultaneously measures the expression levels of tens 
of thousands of genes [8]. Gene expression analysis [10] 
[12] that utilizes microarray technology has a broad variety 
of latent for discovering the biology of cells and organisms 
[9]. Accurate prediction and diagnosis of diseases is been 
assist by the microarray technology. For envisaging the 
entire gene structure, mainly the precise exon-intron 
structure of a gene in a eukaryotic genomic DNA sequence 
gene identification is employed. After sequencing, finding 
the genes is one of the first and most significant steps in 
knowing the genome of a species [13]. A field of 
computational biology which is involved with 
algorithmically distinguishing the stretches of sequence, 
generally genomicDNA that are biologically functional is 
known as gene finding. This in particular not only engrosses 
protein-coding genes but also includes added functional 
elements for instance RNA genes and regulatory regions 
[14]. Some of the researches on the gene prediction are [15], 
[16], [17] and [18]. 

In this paper, we propose an effective gene 
prediction technique which predicts the dominant genes. 
Initially, the classified microarray gene dataset (either Acute 
Myeloid Leukemia (AML) or Acute Lymphoblastic 
Leukemia (ALL)) which is of high dimension is reduced 
through the Probability Principal Component Analysis 
(PPCA) to generate the training dataset for the neural 
network. Consequently, through the training data the Feed 
Forward- ANN is designed and then the genetic algorithm is 
utilized to predict the dominant genes of ALL/AML cancer. 
Subsequently the gene which causes either AML or ALL is 
predicted devoid of analyzing the entire database. The rest 
of the paper is organized as follows. Section 2 details the 
genetic algorithm and in Section 3, a brief review of some of 
the existing works in gene prediction is presented. The 
proposed effective gene prediction is detailed in Section 4. 
Section 5 describes the results and discussion. The 
conclusions are summed up in Section 6. 



II. 



GENETIC ALGORITHM 



they are multi-point search methods. Moreover, GA's are 
applicable to distinct problem in the search space. Hence, 
GA is not only very simple to use but also a very powerful 
optimization tool [28]. Strings are present in the search 
space of GA, each of which represents a candidate solution 
to the problem and are termed as chromosomes. Fitness 
value is the objective function value of each chromosome. A 
set of chromosomes along with their associated fitness is 
termed as population. The populations which are generated 
in an iteration of the genetic algorithm are termed as 
generations [29]. 

New generations (offspring) are generated by 
utilize crossover and mutation techniques. Two 
chromosomes are split by crossover and by taking one split 
part from each chromosome and combining those two new 
chromosomes are created. A single bit of a chromosome is 
changed by mutation. The chromosomes with the best 
fitness value calculated for a certain fitness criteria are 
retained while the other chromosomes are removed. The 
process is repeated until one chromosome has the best 
fitness value and that chromosome is selected as the solution 
for the problem [30]. 



III. 



REVIEW ON RELATED RESEARCHES 



The heredity and evolution of living organisms are 
stimulated by computer programs known as Genetic 
Algorithms [27]. By utilizing GAs an ideal solution is 
possible even for multi modal objective functions because 



A handful of recent research works available in the 
literature are briefly reviewed in this section. 

A computational technique for patient outcome prediction 
was introduced by Huiqing Liu et al. [19]. Two extreme 
types of patient samples were utilized for the training phase 
of this technique: 

1) short-term survivors who got an inopportune result in a 
small period and 

2) long-term survivors who were preserving a positive 
outcome after a long follow-up time. 

These incredible training samples generated a clear 
platform for identifying suitable genes whose expression 
was intimately related to the outcome. With the assistance of 
a support vector machine the selected extreme samples and 
the important genes were then integrated in order to 
construct a prediction model. Every validation sample is 
owed a risk score that falls into one of the special pre- 
defined risk groups by employing that prediction model. 
Several public datasets adapts this technique. In quite a few 
cases as perceived in their Kaplan-Meier curves, patients in 
high and low risk groups who are rated by the suggested 
technique have obviously clear outcome position. They have 
also established that for enhancing the prediction accuracy, 
the suggestion of deciding merely extreme patient samples 
for training is efficient when diverse gene selection 
techniques are employed. 

MiTarget which is a SVM classifier for miRNA 
target gene prediction was introduced by Kim et al. [20]. It 
employed a radial basis function kernel and was then 
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categorized by structural, thermodynamic, and position- 
based features as a similarity measure for SVM features. For 
the first time, the features were presented and the 
mechanism of miRNA binding was reproduced. When 
compared with previous tools the SVM classifier has created 
high performance with the assistance of biologically 
pertinent data set that was attained from the literature. The 
important tasks for human miR-1, miR-124a, and miR-373 
was computed by employing Gene Ontology (GO) analysis 
and the importance of pairing at positions 4, 5, and 6 in the 
5' region of a miRNA was explained from a feature 
selection experiment. A web interface for the program was 
also presented by them. 

Based on the information that a majority of exon 
sequences have a 3-base periodicity, and intron sequences 
do not have the sole characteristic, a technique to predict 
protein coding regions was developed by Changchuan Yin 
et al. [21]. By employing nucleotide distributions in the 
three codon positions of the DNA sequences this technique 
computed the 3-base periodicity and the background noise 
of the stepwise DNA segments of the target DNA 
sequences. From the trends of the ratio of the 3-base 
periodicity to the background noise in the DNA sequences 
the exon and intron sequences can be recognized. Case 
studies on genes from diverse organisms illustrated that the 
proposed technique was an efficient means for exon 
prediction 

On the basis of a two-stage machine learning 
approach a gene prediction algorithm for metagenomic 
fragments was proposed by Hoff et al. [22]. Initially, for 
extracting the features from DNA sequences, linear 
discriminants were employed for monocodon usage, 
dicodon usage and translation initiation sites. Secondly, for 
calculating the chance in such a way that the open reading 
frame encodes a protein and an artificial neural network 
combines these characteristics with open reading frame 
length and fragment GC -content. This probability was 
employed for categorizing and achieving the gene 
candidates. By means of extensive training this technique 
formed fast single fragment predictions with fine quality 
sensitivity and specificity on artificially fragmented 
genomic DNA. Additionally, with high consistency this 
technique can precisely calculate translation initiation sites 
and distinguish complete genes from incomplete genes. 
Extensive machine learning techniques were well-suited for 
predicting the genes in metagenomic DNA fragments. 
Specially, the association of linear discriminants and neural 
networks was a very promising one and are believed to be 
taken into consideration for incorporating into metagenomic 
analysis pipelines. 

Based on the physicochemical features of codons 
computed from molecular dynamics (MD) simulations an ab 
initio model for gene prediction in prokaryotic genomes was 
introduced by Poonam Singhal et al. [15]. For every codon 



the model requires a statement of three computed quantities, 
the double-helical trinucleotide base pairing energy, the base 
pair stacking energy, and a codon propensity index for 
protein-nucleic acid interactions. Fixing these three 
parameters, for each codon, eases the computation of the 
magnitude and direction of a cumulative three-dimensional 
vector for any length DNA sequence in all the six genomic 
reading frames. Analysis of 372 genomes containing 
350,000 genes has confirmed that the orientations of the 
gene and non-gene vectors were significantly apart and a 
apparent difference was made probable between genie and 
non-genic sequences at a level comparable to or superior 
than currently accessible knowledge-based models trained 
on the basis of empirical data, providing a strong evidence 
for the likelihood of a unique and valuable physicochemical 
classification of DNA sequences from codons to genomes. 

For the genus Aspergillus a program called 
NetAspGene which is a dedicated, publicly available, splice 
site prediction was developed by Kai Wang et al. [23]. The 
most widespread mould pathogen that is the gene sequences 
from Aspergillus fumigatus, were employed to build and 
test their model. Aspergillus encloses smaller introns when 
compared with several animals and plants; and hence to 
cover both the donor and acceptor site information they 
have applied a larger window size on single local networks 
for training. NetAspGene was applied to other Aspergilli, 
including Aspergillus nidulans, Aspergillus oryzae, and 
Aspergillus niger. Valuation with independent data sets 
disclosed that NetAspGene executed significantly better 
splice site prediction than the other available tools. 

Bayesian kernel was represented for the Support 
Vector Machine (SVM) by Alashwal et al. [24] so as to 
predict protein-protein interactions. By putting together the 
probability characteristic of the existing experimental 
protein-protein interactions data, the classifier performances 
that were amassed from diverse sources could be improved. 
In addition to that, so as to organize more research on the 
highly estimated interactions, the biologists are enhanced 
with the probabilistic outputs that are attained from the 
Bayesian kernel. The results have illustrated that by 
employing the Bayesian kernel when compared with the 
standard SVM kernels, the precision of the classifier has 
been enhanced. Those results have suggested that by means 
of Bayesian kernel, the protein-protein interaction could be 
computed with superior accuracy as when compared to the 
standard SVM kernels. 

IV. PROPOSED DOMINANT GENE PREDICTION 

USING GENETIC ALGORITHM 

Generally, utilization of large gene dataset for 
disease analysis increases the computation time and 
degrades the performance of the process. Hence, a technique 
that requires less computational time to predict dominant 
genes is essential. Hence, an efficient technique is proposed 
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to predict the dominant genes of cancer (either AML or 
ALL) from a microarray gene dataset. The three phases 
involved in the proposed technique are generation of 
training dataset, training through neural network and genetic 
algorithm based dominant gene prediction. Preprocess of 
dominant gene prediction process is illustrated in Fig. 1 and 
the feed forward neural network is depicted in Fig. 2. 

A Preprocess for dominant gene prediction 

The pre processing steps for predicting dominant 
genes are explained in the following steps. 



Microarray gene expression data 



Generation of possible combination 



T 



Dimensionality reduction using PPCA 



Design FF-ANN for classification 



Fig.l preprocessing steps for dominant gene prediction 

1) Generation of training dataset 

In this phase, in order to generate the training set 
for the ANN, it is essential to generate the possible 
combinations of the gene dataset. The two processes 
involved in the generation of training dataset are generation 
of possible combinational data and dimensionality reduction 

Possible combinational data are generated by 
classifying the microarray gene dataset with a lot of 
combinations within the dataset. This combinational data is 
generated with the intention of making easier the learning 

process for dominant genes prediction. Let M z -; be the 
microarray gene dataset, where < z < N s — 1 
and < j < Nq — 1. Here, N s represents the number of 
samples and N q represents the number of genes and the 

size of Mfj is given by N s xNq. The number of 
possible combinational data is calculated as follow, 



Na of possibleoombinatins = 



((N s xN)-k)\k\ 



(1) 



2) Dimensionality reduction by PPCA 

The dimension of the M r .. must be reduced for 

the upcoming processes. The dimensionality reduction is 
done utilizing the probabilistic Principal Component 

Analysis (PC A) and the high dimensional M c . 



'!/ 



was 



converted to low dimension. The dimensionality reduced 
data is utilized as the training dataset for the neural network. 
We reduce the dimensionality using PPCA, which is a PCA 
that has a probabilistic model for the data. The PPCA 
algorithm which was composed by Tipping and Bishop [25] 
utilizes a rightly formed probability distribution of the 
higher dimensional data and calculates a low dimensional 
representation. 

The instinctive attraction of the probabilistic 
representation is because of the fact that the definition of the 
probabilistic measure allows comparison with other 
probabilistic techniques, at the same time making statistical 
testing easier and permitting the utilization of Bayesian 
methods. By making use of PPCA as a generic Gaussian 
density model dimensionality reduction can be achieved. 
Efficient computation of the maximum-likelihood estimates 
for the parameters connected with the covariance matrix 
from the data principal components is facilitated through 

dimensionality reduction. The combinational data M c .. of 



dimension 



N s x N g is 



y 



reduced through the PPCA to 



M, 



of dimension N Q xN r 



In addition to 



~ij * 9 

dimensionality reduction, the PPCA finds more practical 
advantages such as finding missing data, classification and 

novelty detection [26]. Thus training dataset M c .. for the 



ANN is generated with reduced dimension N s 






B. Training phase: Training through Feed Forward ANN 

The proposed technique incorporates a multilayer 
feed forward ANN with back propagation for predicting the 
dominant genes of the AML/ALL cancer. A feed-forward 
network maps a set of input values to a set of output values 
and can be thought of as the graphical representation of a 
parametric function. The dimensionality reduced microarray 
gene dataset is utilized for training the feed forward Neutral 
network with back propagation. 



The combinational data M c .. has a high dimension of 



N s x N q which has to be reduced so as to be utilized in 



further processing. 



The single network N is trained in our proposed 
approach; the network is for receiving the dimensionality 
reduced gene dataset, and outputs the gene value whether it 
is ALL/AML. Hence, the network is configured with 

N input units and hidden and an output unit. 
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Step 1: As the first step, set the input weights of every 
neuron, apart from the neurons in the input layer. 

Step 2: A neural network with N input layers, a 
N hidden layers and an output layer are designed. In this 
neural network, N s (dimensionality reduced) input neurons 
and a bias neuron, N hidden neurons and a bias neuron 
and an output neuron y { are presented. 



the remaining layers (hidden and output layer, but with the 
number of hidden and output neurons, respectively). The 

output of the ANN is determined by giving it M c as the 

input. 

Step 5: The learning error is determined for the NN as 
follows 



N s -1 



N s b=0 



(5) 



Step 3: The designed NN is weighted and biased. The 
developed NN is shown in the Fig. 2. 

Step4: The basis function and the activation function which 
is chosen for the designed NN are shown below. 




Fig 2. n Inputs one output Neural Network to train the gene 
dataset 



Ng-l 

Yi=a+ £ w tJ M c 0<i<i\r;'-l 
1 



g(y)= v 

1 + e y 

g(y) = y 



(2) 

(3) 
(4) 



Eq.(2) is the basis function for the input layer, where M c is 
the dimensionality reduced microarray gene data, W-- is the 

weight of the neuron and a is the bias. The sigmoid 
function for the hidden layer is given in Eq.(3) and the 
activation function for the output layer is given in Eq.(4). 
The basis function given in Eq. (1) is commonly used in all 



Here, E is the error in the FF-ANN, D is the desired 
output and Y b is the actual output. 

1) Minimization of Error by BP algorithm 

The steps involved in training BP algorithm based NN is 
given below. 

a) Randomly generated weights in the interval [0,lj are 

assigned to the neurons of the hidden layer and the output 
layer. But all neurons of the input layer have a constant 
weight of unity. 

b) In order to determine the BP error using Eq. (5), the 
training gene data sequence is given to the NN. Eq. (2), Eq. 
(3) and Eq. (4) show the basis function and transfer 
function. 

c) The weights of all the neurons are adjusted when the BP 
error is determined as follows, 



W = w + Aw 

ij i] i] 



(6) 



The change in weight Aw ; given in Eq. (6) can be 
determined as Aw- = y.y~.E , where E is the BP error 
and y is the learning rate, normally it ranges from 0.2 to 



0.5. 



d) After adjusting the weights, steps (b) and (c) are repeated 
until the BP error gets minimized. Normally, it is repeated 
till the criterion, E < 0.1 is satisfied. 

When the error gets minimized to a minimum value 
it is construed that the designed ANN is well trained for its 
further testing phase and the BP algorithm is terminated. 
Thus, the neural network is trained by using the samples. 
Then to determine the dominant genes of the ALL/AML 
cancer the genetic algorithm is applied. 
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C. Testing phase: Genetic Algorithm based dominant gene 
prediction ofAML/ALL cancer 

In the training phase, by means of the training 
dataset the FF-ANN is designed and the well trained 
network is utilized for predicting the dominant genes in an 
efficient manner. The genetic algorithm is applied on the 
classified test sequence and then this test sequence is 
evaluated and the dominant genes are predicted. In this GA 
based dominant gene prediction, initially, the random 
chromosomes are generated. The random chromosomes are 
the indices of the test sequence which are classified as 
ALL/AML. The genes are generated without any repetition 
within the chromosome. After generating the chromosomes, 
the fitness is calculated by providing the genes of the 
chromosome which are the indices as input to the designed 
FF-ANN. Then, by subjecting the chromosomes to the 
genetic operations, crossover and mutation, newly generated 
chromosomes are obtained. Then the fitness is determined 
for the newly generated chromosomes. The generated new 
chromosomes are given as input to the designed FF-ANN. 
The optimal chromosomes are obtained by analyzing the 
threshold value. The process is repeated until optimal gene 
values are obtained. The process of genetic algorithm to 
predict the dominant gene is depicted in fig. 3 



f Generate Initial chno mo somes 



j- 



Calculate the fitness through 
the FF-ANN 



Crossover 

— r~ 



Mutation 



Select the best chromosomes 




Obtain best chromosomes 



Obtain the genes indexed by 
the best chromosomes 



Fig 3. Proposed genetic algorithm for dominant gene prediction 

1) Generation of chromosomes 

Initially generate N p number of random 
chromosomes and the number of genes in each chromosome 



relies on N i.e. number genes in the training dataset. As 

discussed earlier, the generated genes are the indices of the 
test input sequence. 



D (k) = J D (k) D (k) D (k) _ D ( k )\ 

0<k<N p -1 0</<n-l 

n- Number of genes in the training dataset. 



(?) 



In eq.7, D, represents the V gene of the k l 

chromosome. These genes are generated without any 
repetition within the chromosomes. Once the 

N p chromosomes are generated then the fitness function is 
applied on the generated chromosomes 

2) Fitness Function 

The fitness of the generated chromosomes is 
evaluated using the fitness function by giving the 
chromosomes as input to the designed FF-ANN. 



N p -1 



Mnet - 



k=0 



(8) 



N pt = 



(!-vO C 
c = if test sequence is ALL 

c = 1 if test sequence is AML 



(9) 



In Eq. (8), JV 0Ut is the network output obtained from the 

FF-ANN for the k chromosome and iV\ t in Eq. (9) is the 
fitness value of the initially generated chromosomes. 

3) Crossover and Mutation 

The two point crossover is chosen with the 
crossover rate ofC^ amid diverse kinds of crossovers. 
Using eq. (10) and (11) two points are selected on the parent 
chromosomes in the two point crossover. The genes that are 

present in between the two points cr x and cr 2 are 
exchanged among the parent chromosomes, hence N 
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children chromosomes are attained. The crossover points 
cr x and cr 2 are determined as follows 



dimension 



cr 1 = 



cr n = 



m_ 2 

3 

m +2 

2 



(10) 

(11) 



The children chromosomes are acquired now and their 
corresponding gene values are store discretely and their 

corresponding indices from theD, are stored inD new/ . 

Subsequently mutation is executed by employing Eq. (9) on 
the chromosomes that are obtained after crossover. Then, by 

reinstating N m number of genes from every chromosome 
with new genes, mutation is achieved. The N m numbers of 

gene are just genes, which have the least N oul (as 

determined from the Eq. (9)). The arbitrarily generated 
genes are the replaced genes devoid of any recurrence 
within the chromosome. Then, the selected chromosomes 
for crossover operation, and the chromosomes which are 
obtained from mutation are combined, hence the population 

pool is filled up with the N chromosomes. Then, until a 

maximum iteration of I max is reached this process is 
repeated iteratively. 

4) Selection of optimal solution 

The best chromosomes are selected from the group 
of chromosomes that is obtained after the process is 

repeated J max times. Here, the best chromosomes are the 

chromosomes which have minimum fitness for both 
ALL/AML which may depend upon the c value. The 
obtained best chromosomes are used to retrieve the 
corresponding gene values from the test sequence. The gene 
values of the ALL/AML cancer represented by the indices, 
which are obtained from the genes of the best chromosomes, 
are the dominant genes of the ALL/AML and they are 
retrieved in an effective manner. 

V. IMPLEMENTATION RESULTS AND 

DISCUSSION 

The proposed dominant gene prediction technique 
is implemented in the MATLAB platform (Version 7.10) 
and it is evaluated using the classified microarray gene 
expression data of human acute leukemias. The standard 
leukemia dataset for training and testing is obtained from 
[26]. The training leukemia dataset is of dimension 

N q = 7192 and N s = 38 . This dimension of the dataset 

is too high to train the FF-ANN and hence its dimension is 
reduced using PPCA and then the training dataset of 



N g = 30 and 



N s = 38 



is obtained. This 



training dataset is utilized to design the FF-ANN and then 
the test input sequence is tested through the genetic 
algorithm. The selected double point crossover points are 

cr x = 8 and cr 2 = 22 with a crossover rate C R = 0.5 
and for mutation iV m =5. After the completion of the 

crossover and mutation operations, based on the conditions 
given in section 4, the optimal chromosomes were obtained. 
These optimal chromosomes are the indices of the ALL 
cancer test sequence. This process is repeated until it 

reaches the maximum iteration J max = 20. The training of 

FF-ANN is implemented using the Neural Network Toolbox 
in MATLAB. Fig 4 shows the Regression of the designed 
FF-ANN and the Fig 5 shows the performance of the 
designed FF-ANN. Fig 6 depicts the performance of the 
ALL test sequence during the testing process and the Fig 7 
depicts the performance of the AML test sequence during 
the testing process. 
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Figure 4: Regression output of the designed FF-ANN 
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Figure 5: Performance of BP in training the designed FF-ANN 
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Figure 6: The performance of ALL during the testing process 
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Figure 7: The performance of AML during the testing process 

Once the training process of the FF-ANN is 
completed, the input sequence either ALL or AML is tested 
through the genetic algorithm and then the dominant gene of 
either ALL or AML has been obtained. In Fig 6, the 
performance of the ALL input sequence has been tested and 
the obtained dominant gene based on some criteria 
(mentioned in the section 4) is depicted differently from the 
regular genes. In Fig 7, the performance of the AML input 



sequence has been tested and the obtained dominant gene 
based on some criteria (mentioned in the section 4) is 
depicted differently from the regular genes. The table 1 
demonstrated the dominant genes of the ALL and AML 
below 



ALL 


AML 


Indices 


Dominant 
Genes 


Fitness 
by FF- 
ANN 


Indices 


Dominant 
Genes 


Fitness 
by FF- 
ANN 


6041 


1284 


0.4467 


3196 


-162 


2.2381 


6378 


-231 


647 


119 


3845 


-11 


1024 


12450 


5764 


36 


2269 


757 


3267 


390 


4108 


177 


518 


1396 


1036 


910 


6485 


62 


1077 


1361 


3756 


-482 


4763 


3381 


3812 


251 


1905 


118 


4122 


-16 


3790 


148 



Table 1. The indices of dominant genes, dominant genes and their fitness 



VI. 



CONCLUSION 



In this paper, an effective genetic algorithm based 
method to predict the dominant genes in the ALL/AML 
dataset was discussed. The proposed technique, instead of 
analyzing the entire database, analyzed only the dominant 
genes and hence it has provided the optimal results. The FF- 
ANN was designed by means of training samples to assess 
the test sequence in the proposed genetic algorithm. Then, 
the fitness of the test sequence samples was evaluated 
through the designed FF-ANN. After that, the test input 
sequence was evaluated and the dominant genes were 
predicted through the genetic algorithm. The obtained 
fitness of the ALL dominant genes through the FF-ANN is 
0.4467 and for AML dominant genes is 2.2381. Table 1 
demonstrated the dominant genes of the ALL and the AML. 
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Abstract :- Chronic disease anemia [1] occurs when blood 
doesn't have enough hemoglobin. Hemoglobin is a protein in red 
blood cells that carries oxygen from lungs to the rest of our 
body. All the body parts need oxygen. Anemia can starve our 
body of the oxygen it needs to survive. Possible causes of anemia 
include low vitamin B 12 or folic acid intake and some chronic 
illnesses. But the most common cause is not having enough iron 
in blood which needs to make hemoglobin. This type of anemia 
is called iron deficiency anemia. 

Data Mining is widely used in database communities because 
of its wide applicability. One major application area of Data 
Mining is in therapeutic diet prediction. There are several 
chronic diseases which can be prevented using nutritive food. 
This paper presents association and correlation between anemia 
human subject and its prevention through diet nutrients. The 
role of diet in preventing and controlling iron deficiency is 
significant. Due to changes in dietary and life style patterns 
anemia can become catastrophic, so by predicting proper and 
sufficient diet nutrients for individuals, we can reduce the 
impact of anemia on human subjects. 

Keywords :- Chronic Disease, Anemia, Diet Nutrients, Clinical 
System, Correlation, Data Mining 

i. Introduction 

Data Mining is referred to as Knowledge Discovery from 
databases[5], a process of nontrivial extraction of implicit previously 
unknown and potentially useful information from databases, has 
wide application in information management concepts, query 
processing, decision making, process control, statistical analysis etc. 
[4], [5] An association rule mining is an important process in data 



mining, which determines the correlation between items belonging 
to transaction database [6], [7]. 

Chronic disease is a diseasefl 0] that is long lasting for recurrent. 
Anemia also becomes a chronic disease if it is not cured timely. It is 
prolonged, do not resolve spontaneously and are rarely cured 
completely. If anemia results from a diet which is low in iron, iron 
rich foods or iron pills may be the doctor suggests. Data Mining 
technology can be utilized for improving the quality of health care 
of an individual. There is need for lowering the cost of health care 
facilities along with quality based treatment especially for poor 
sections of our society. 

Health care organization need to lower cost, raise quality and still 
remain competitive. IT can be used for patient health care. In the IT 
driven society the role of IT in health care is well established. We 
can use data mining techniques for analyzing patient data to 
generate predictions and knowledge which can in turn be used for 
fast and better clinical decision making. Medical data mining deals 
with large amount of data, there fore we need data mining 
techniques, which can explore hidden patterns in the data sets of the 
medical domain. These patterns can be utilized for clinical diagnosis 
and prediction. 

The available raw medical data are widely distributed, 
heterogeneous in nature and voluminous, therefore these data need 
to be collected in an organized form. This collected data can be then 
integrated to form as a basis for prediction of nutrients chronic 
diseases. Data mining technology provides a user oriented approach 
to extract hidden patterns from the data. Data mining can deal with 
heterogeneous type of data. Medical data are different from data in 
other databases. Medical data are heterogeneous and contain text 
and images also, for example MRI , ECG, etc generate huge 
amounts of heterogeneous medical data. Knowledge discovery from 
this type of data can greatly benefit mankind by improved diagnostic 
techniques. 
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A. Background 

Diet and nutrition are important factors in the promotion and 
maintenance of good health throughout the entire life course. The 
chronic diseases related to diet and nutrition are anemia, obesity, 
diabetes, cardiovascular disease, cancer, osteoporosis etc. Anemia 
can be defined as a reduction in the hemoglobin, hematocrit or red 
cell number [10]. The sudden, rapid loss of 30% of the total blood 
volume often results in death. The burden of chronic disease is 
rapidly increasing world wide .It has been calculated that in 2001, 
chronic diseases contributed approximately 60% of the 56.5 million 
total reported deaths in the world and approximately 46% of the 
global burden of disease. 

The chronic disease problem has effected a large proportion of 
our population. It has been projected that by 2020 chronic diseases 
will account for almost three quarters of all deaths worldwide are 
due to heart diseases. Chronic diseases are largely preventable 
disease. Research has shown that diet can control chronic diseases to 
large extent[ll]. Modern dietary patterns and physical activity 
patterns are beneficial in prevention of chronic diseases. The 
chemical composition of food and physiological response to diet can 
be discovered. Unraveling the interconnection between diet and 
health through data mining techniques is the basis of this paper 
identification of disease and diet associated composite molecular 
biomarkers will facilitate identification of new molecular targets for 
development of novel therapeutic agents to address diet related 
chronic diseases . 

B. Data Mining And Statistics 

Data Mining is designed[2] to learn future prediction. The Data 
Mining tool checks the statistical significance of the predicted 
patterns and reports. The difference between Data Mining and 
statistics is that Data Mining automates the statistical process 
requiring in several mining tools. Statistical inference is assumption 
driven in the sense that a hypothesis is formed and tested against 
data. Data Mining in contrast is discovery driven. That is, the 
hypothesis is automatically extracted from the given data. The other 
reason is Data Mining techniques tend to be more robust for real- 
world data and also used less by expert users. 

C. Statistical Correlation 

Through correlation[3] we may find that a change in one variable 
result in change of second variable. 

Whenever there exists a relationship between two variable such 
that a change in one variable results in a positive or negative change 
in other and also greater change in one variable results in a 
corresponding greater change in the other, the relationship is called 
correlation and the two variables are called correlated. 

Two variables are called positively correlated if corresponding to 
an increase (or decrease) in one variable results in an increase (or 
decrease) in the other. 

Two variables are negatively correlated if corresponding to an 
increase (or decrease) in one variable results in decrease (or 
increase) in the other. 



ii. Association and correlation to build 

RELATIONSHIP 

In this paper we are dealing with two distinct systems, anemia 
human subject and diet prediction. Based on the level of hemoglobin 
of individual, we have made an attempt to associate human subject 
and iron requirement. It has been found that average range of 
hemoglobin for males is 14-16 and for females is 12-14. The iron 
requirement for average range of hemoglobin individual according 
to RDA (Recommended Dietary Allowances) table is 28mg for 
males and 30mg for females. The table given below depicts the fact 
that the iron requirement of females is more than that of males. We 
have used the correlation statistical technique for finding the 
relationship between both the systems. 

In every case of anemia[9], the cause should be discovered and 
treated. In clinical practice, nutritional anemia is commonly 
associated with overall under-nutrition and a balanced diet should be 
given. Usually, diet alone is not adequate and therapy with specific 
supplements - particularly iron - is also needed. Supplements of 
lOmg (179 micro mol) iron daily prevent iron deficiency. 
Association rule mining [5] uses support confidence framework. 
Diet can cure anemia, this statement can be justified through support 
confidence measure. We can augment support confidence 
framework through correlation. The Bayesian correlation technique 
has been used to find the correlation between required level of iron 
(in mg) and range of hemoglobin of individuals. 

III. EXPERIMENTAL RESULTS 

Sufficient safe and varied food supplies not only prevent 
malnutrition but also reduce the risk of chronic diseases whereas 
nutritional deficiency increases the risk of common infectious 
diseases generally in children. This paper has used data mining 
techniques to investigate factors that contribute significantly in 
reducing the risk of chronic disease anemia. 



There are 
variables: 

(i) 
(ii) 



two methods of analyzing correlation between 



Karl Pearson's Method 
Scatter Diagram Method 



(i) Use of Karl Pearson 's method to find coefficient of correlation: 

Correlation is the relationship between two or more than two 
variables. We are finding correlation between anemia human 
subjects(male and female) and predicted diet. Anemia is a chronic 
disease caused due to several factors. Among those factors 
inadequate and malnutrition is a major cause of anemia. We have 
generated the following correlation Table 1 for male anemia subject 
with mid value of hemoglobin range(X m ) and iron intake (Y m mg):- 



Hb range: 



Iron intake: 



Table 1. 
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£x*y = 
-101.32 



Table 2. 



M Xffi =£X m In =48/5 = 9.6 

M Yffi =2;Y ffi In =171/5 = 34.2 

The coefficient of correlation for male anemia subjects: 

r = £x*yW£x 2 *£y 2 
= -101.32/V(91.2)*(113.16) 
= -101.32/V (10320.192) 
= -101.32/101.5883 
= -0.9974 



(ii) Scatter Diagram Method: The diagrammatic representation of a 
bivariate data is known as scatter diagram. For bivariate the values 
of variables X and Y are plotted (male and female) in the X-Y plane. 
X axis denote range of hemoglobin and Y axis denotes iron required 
for individuals. 

Figure 1 
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M xf =IX f /n =33/4 = 8.25 
M Yf =IY,/n =146/4 = 36.5 

The coefficient of correlation for female anemia subjects: 

r = £x*yW£x 2 *£y 2 
= -83.5/V(54.74)*(129) 
= -83.5/V (7061.46) 
= -83.5/84.0325 
= -0.9937 
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In this scatter diagram all the points lie on a line and it declines 
from left to right, this depicts the fact that there is a negative 
correlation of high degree between the variables, i.e. if the 
hemoglobin count of individual is less than the required iron is 
more. 

We have generated the following correlation Table 1 for female 
anemia subject with mid value of hemoglobin range(X) and iron 
intake (Ymg):- 



Hb range: 
Iron intake: 
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Figure 2 
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If measure of correlation is between -0.75 and -1, negative then 
measure of correlation is of high degree. 

Both the methods have justified the same fact that there is 
negative correlation of high degree between hemoglobin range and 
required iron. 

iv. Conclusion and future scope 

Data Mining can be performed by dietitians and hospital 
administration to prepare diet chart for anemic patients. The goal is 
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to detect when the Hb percent of patient is very low the diet 
predicted should contain food high in iron - such as seafood, dried 
fruits like apricots, prunes, raisins, nuts, beans, green leafy 
vegetables, whole grains etc.. 

This research has revealed the fact that anemia human subjects 
should be recommended food containing high percentage of iron and 
those having normal range of hemoglobin can take food containing 
average percentage of iron. 

There exist negative correlation[3] between diet high in iron and 
Hb% of patient. If Hb% of patient is low, diet should be rich in iron 
and if the reverse condition exist i.e. Hb% of patient is high, diet can 
contain average percentage of iron. 

This paper depicts the correlation between anemia human subject 
and diet intake of individual. Anemia is common disease effecting 
large masses of people and if it is not cured timely, it becomes 
catastrophic. Organisation for social services can use the above 
result for preventing anemia of particular area. There are several 
areas which suffer from some kind of deficiencies such as Calcium 
deficiency is common in Bhopal, Iodine deficiency in Kannur, 
Purulia Distt, West Bengal, Sickle Cell anemia in tribal population 
of Maharashtra, Gujrat, Orissa and Tmilnadu etc.. They can use 
above conclusion to predict diet for anemia affected areas. Data 
mining deals with large data set, suppose we consider total human 
population of any area and we can classify the population into male 
and female data sets. In areas where iron deficiency is common the 
hemoglobin range of individuals will be below 8%. 

This paper has used the correlation concept of statistics to depict 
and justify the fact that there is high degree of correlation between 
hemoglobin range of individual and iron intake. By using this fact 
organizations can plan diet patterns for areas facing the problem of 
anemia. The entire population of that area can be prescribed iron 
rich food. Dietitians can also use this fact for preventing this 
common disease of particular geographical area. There are several 
remote areas in India which suffer from this common disease. 
Organisations can directly plan nutrients for those needy people 
without the need for performing unnecessary calculations. This 
result will be beneficial for large masses of people. They can be 
prevented from this chronic disease by giving iron rich food, so the 
above results are beneficial for our society. 
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Abstract — This Refactoring is the process of changing a software 
system aimed at organizing the design of source code, making 
the system easier to change and less error-prone, while 
preserving observable behavior. This concept has become 
popular in Agile software methodologies, such as extreme 
Programming (XP), which maintains source code as the only 
relevant software artifact. Although refactoring was originally 
conceived to deal with source code changes. 

Two key aspects of extreme Programming (XP) are unit 
testing and merciless refactoring. We found that refactoring test 
code is different from refactoring production code in two ways: 
(1) there is a distinct set of bad smells involved, and (2) 
improving test code involves additional test code refactorings. we 
describe a set of code smells indicating trouble in test code and a 
collection of test code refactorings explaining how to overcome 
some of these problems through a simple program modification. 

Keywords- Test Smell, Test Case, Refactoring, Unit Testing, 
Object Oriented, TDD. 

I. Introduction (Heading 1) 

Computer software is an engine of growth of social- 
economy development which requires new techniques and 
strategies. The demand for quality in software applications has 
grown. Hence testing becomes one of the essential components 
of software development which is the indicator of quality [4]. 

"Testing proves the presence, not the absence of bugs" 
-- E.W.Dijkstra 

The unit test provides the lowest level of testing during 
software development, where the individual units of software 
are tested in isolation from other parts of program/software 
system. Automated Testing is the other program that runs the 
program being tested, feeding it with proper input, and thus 
checking the output against the expected. Once the test case is 
written, no human intervene is needed thus the test case does 
all andindicate[12]. 

Adequate testing of software trials prevent this tragedies 
to occur. Adequate testing however, can be difficult if the 
software is extremely large and complex. This is because the 
amount of time and efforts required to execute a large set of 
test cases or regression test cases be significant [3]. Therefore, 



the more testing can be done with accuracy of test cases which 
assist in corresponding rise in program transformation. 

Amongst different types of program transformation, 
behavior-preserving source- to- source transformations are 
known as refactorings [2]. Refactoring is the process of 
changing a software system in such a way that it does not alter 
the external behavior of the code yet improves its internal 
structure [8]. 

The refactoring concept was primarily assigned to 
source code changes. The refactoring of test case may bring 
additional benefits to software quality and productivity, vis-a- 
vis cheaper detection of design flaws and easy exploration of 
alternative design decisions. Consequently, The term code 
refactoring and test case refactoring can be made distinct. Thus, 
one of the main reasons for wide acceptance of refactoring as a 
design improvement technique and its subsequent adoption by 
Agile software methodologies, in particular eXtreme 
Programming (XP) [1]. The XP encourages the development 
teams to skip comprehensive initial architecture or design 
stages, guiding them its implementation activities according to 
user requirements and thus promoting successive code 
refactorings when inconsistencies are detected. 

II. Test -Driven Development 

Test Driven Development (TDD) is the core part of the 
Agile code development approach derived from eXtreme 
Programming (XP) and the principles of the Agile 
manifesto. It provides to guarantee testability to reach an 
extremely high test coverage, to enhance developer confidence, 
for highly cohesive and loosely coupled systems, to allow 
larger teams of programmers to work on the same code 
base, as the code can be checked more often. It also 
encourages the explicitness about the scope of 
implementation. Equally it helps separating the logical and 
physical design, and thus to simplify the design, when only the 
code needed. 

The TDD is not a testing technique, rather a 
development and design technique in which tests are written 
prior to the production code. The tests are added its gradually 
during its implementation and when the test is passed, the 
code is refactored accordingly to improve the efficacy of 
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internal structure of the code. The incremental cycle is 
repeated until all functionality is implemented to final. 

The TDD cycle consists of six fundamental steps: 

1 . Write a test for a piece of functionality, 

2. Run all tests to see the new test to fail, 

3. Write corresponding code that passes these tests, 

4. Run the test to see all pass, 

5. Refactor the code and 

6. Run all tests to see the refactoring did not change the 
external behavior. 

The first step involves simply writing a piece of code to 
ensure the tests of desired functionality. The second is required 
to validate the correctness of test, i.e. the test must not pass at 
this point, because the behavior under implementation must not 
exist as yet. Nonetheless, if the test passes over, means the 
test is either not testing correct behavior or the TDD principles 
have not been strictly followed. The third step is the writing of 
the code. 



Write a Test 



Run the Test 




Modify Code 



Remove 
Bug 



Run the Test 



Eails. 




Passes 



► Refactor 



Run the Test 



-Fails- 




Figure - 1 



However, it should be kept in mind to only write as little 
code as possible to enable to pass the test. Next, step is to see 
that the change has not introduced any of the problems 
somewhere else in the system. Once all these tests are passed, 
then the internal structure of the code should be improved by 
refactoring. The afore mentioned cycle is presented in Fig 1. 

III. Refactoring 

The Program restructuring is a technique for rewriting 
software may be useful either for legacy software as well as for 
the production of new systems[ 6, 8,11]. If the internal 
structure is changed, although the behavior (what the program 
is supposed to do) is maintained. Restructuring re-organizes the 
logical structure of source code in order to improve specific 
attributes [8] or to make it less error-prone when future changes 
are introduced [11]. 

Behavior preserving program changes are known as 
refactorings which was introduced by Opdyke [10]. Yet it's 
gaining importance by Fowler's work [5] and eXtreme 
Programming (XP) [1], an Agile software development in 
context of object-oriented development. In this context, a 
refactoring is usually composed of a set of small and atomic 
refactorings, after which the largest source code is better than 
the original with respect to particular quality attributes, such as 
readability and modularity. 

Thus, refactoring can be viewed as a technique for software 
evolution through-out software development and maintenance. 
Software evolution can be classified into the following types 
[7]: 

- Corrective evolution: correction of errors; 

- Adaptive evolution: modifications to accommodate 
requirement changes; 

- Perfective evolution: modifications to enhance existing 
features. 



Refactoring is mostly applied in perfective software 
evolution, though it also affects corrective and adaptive 
evolution. First, well-organized and flexible software allows 
one to quickly isolate and correct errors. Second, such software 
ensures that new functionality can be easily added to address 
changing user requirements. 

A known issue about refactoring is automatization. 
Small steps of refactoring have usually been performed 
manually, using primitive tools such as text editors with search 
and replace functionality. This situation eventually leads to 
corrupt the design of source code, mostly due to the fact that 
manual refactoring is tedious and prone to errors [2]. Although 
the choice of which refactoring to apply is naturally made by 
human, automatic execution of refactorings might result in a 
major improvement in productivity. 

In addition, concerning behavior preservation, TDD 
informally guides refactoring assisted by unit tests, increasing 
the correctness of a sequence of transformations. Furthermore, 
verification of object-oriented programs is highly nontrivial. A 
number of recent research initiatives have pointed out 



97 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 8, No. 8, November 2010 



directions for formally justifying refactorings. In Opdyke's 
work, preconditions for refactorings are analyzed [10], whereas 
Robert's work formalizes the effect of refactorings in terms of 
postconditions, in order to build efficient refactoring tools [2]. 
In contrast, Mens [9] apply graph representation to aspects that 
should be preserved and graph rewriting rules as a formal 
specification for refactorings. 

IV. Causes of Refactoring 

In computer programming, code smell is any 
symptom in the source code of a program that possibly 
indicates a problem at steep level. 

Often the deeper problem hinted by a code smell can 
be uncovered when the code is subjected to a short 
feedback cycle where it is refactored in small, controlled 
steps, and the resulting design is examined to assist the 
needs of more refactoring. From the programer's point of 
view, code smells are forecast to refactor, and what 
specific refactoring techniques are to be used. Thus, a 
code smell is a driver for refactoring. Code smell hint that 
provides can be improved in some where in your code. 

Determining a code smell is often a subjective 
judgment and will often vary by language, developer and 
its methodology. There are certain tools, such as 
Checkstyle, PMD and FindBugs for Java, to automatically 
evaluate for certain kinds of code smells. 

When to apply refactorings to the test code, is different 
from refactoring production code and the test code has a 
distinct set of smells dealing with the test cases are organized, 
to study its implementation and interaction with each other. 
Moreover, improving test code involves a mixture of 
refactorings from specialized to test code improvements as 
well as a set of additional refactorings involving the 
modification of test classes, ways of grouping test cases, and 
soon[5]. 

Refactoring (to Patterns) 

• Simple Design -> Code Smell -> Refactor 

• Refactoring ( to Patterns) is the ability to transform a 
"Code Smell" into a positive design pattern 

Following are the examples of some of the Bad Code 
Smells that are encountered in case (unit/class) design 

• Duplicated Code 

• Methods too big 

• Nested "if statements 

• Classes with too many instance variables 

• Classes with too much code 

• Strikingly similar subclasses 

• Too many private (or protected) methods 

• Similar looking code sections 



• Dependency cycles 

• Passing Nulls To Constructors 

• Classes with too little code 



V. Test Case Code Smells 

This section gives an overview of bad code smells that are 
specific for test code. 

A. Self Contained 

When a test uses external resources, such as file containing 
test data, the test is no longer self contained. Consequently, 
there is no enough information to understand the test 
functionality, to use it as test documentation. 

Moreover, external resources introduces hidden 
dependencies: if some force mutates such a resource, tests start 
failing. Chances for this increase becomes more when more 
tests use the same resource. The use of external resources can 
be thus eliminated using refactoring Intregral Resource. 

B. Resource Optimism 

Test code that makes optimistic assumptions about the 
existence (or absence) and state of external resources (such as 
particular directories or database tables) can cause 
nondeterministic behavior in test outcomes. The situation 
where tests run fine at one time and fail miserably at the other 
time needs to be avoided. Resource Allocation refactoring used 
to allocate and/or initialize all resources that are to be used. 

C. Resource Interface 

Such wars arise when the tests execute you are the only one 
testing which fails when more programmers run them. This is 
most likely caused by Resource Interference: some tests in your 
suite allocate resources such as temporary files that are also 
used by others. Identified Uniquely is one of the test code 
refactoring method used to overcome Resource Interference. 

D. Setup Method 

In the JUnit framework a programmer can write a setUp 
method that can be executed before each test method to create a 
fixture for the tests to run. Things start to smell when the setUp 
fixture is too general and different tests only access part of the 
fixture. Such setUps are harder to read and understand. 

Moreover, they may make tests run more slowly (because 
they do unnecessary work). The danger of having tests that take 
too much time to complete is that testing starts interfering with 
the rest of the programming process and programmers 
eventually may not run the tests at all. 

E. Splitting Method 

When a test method checks methods of the object to be 
tested, it is hard to read and understand, and therefore more 
difficult to use as documentation. Moreover, it makes tests 
more dependent on each other and harder to maintain. The 
solution is simple: separate the test code into test methods that 
test only one method. Note that splitting into smaller methods 
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which can slow down the tests due to increased setup/teardown 
overhead. 

F. Assertion Roulette 

"Guess what's wrong?" This smell comes from having a 
number of assertions in a test method that have no explanation. 
If one of the assertions fails, it becomes difficult to know the 
cause of concern. Use Asertion Explanation to remove this 
smell. 

G. Class-to-be-tested 

A test class is supposed to test its counterpart in the 
production code. It starts to smell when a test class contains 
methods that actually perform tests on other objects (for 
example because there are references to them in the class-to- 
be-tested). The smell which arises also indicates the problems 
with data hiding in the production code. Note that opinions 
differ on indirect testing. Some people do not consider it a 
smell but a way to guard tests against changes in the "lower" 
classes. We feel that there are more losses than gains to this 
approach: It is much harder to test anything that can break in an 
object from a higher level. Moreover, understanding and 
debugging indirect tests is much harder. 

H. Duplication across Test Class 

Test code may contain undesirable duplication. In particular 
the parts that set up test fixtures are susceptible to this problem. 
Solutions are similar to those for normal code duplication as 
described by Fowler [5]. The most common case for test code 
will be duplication of code in the same test class. For 
duplication across test classes, it may prove helpful to mirror 
the class hierarchy of the production code into the test class 
hierarchy. A word of caution however can introduce 
dependencies between tests moving duplicated code from two 
separate classes to a common class. 

A special case of code duplication is test implication: 
test A and B cover the same production code and A fails if and 
only if B fails. A typical example occurs when the production 
code gets refactored before such refactoring. 

VI. Test Code Refactoring 

Bad smell seems to arise more often in production code 
than in test code. The main reason for this is that, production 
code is adopted and refactored more frequently allowing these 
smells to escape. 

One should not, however, underestimate the importance of 
having fresh test code. Especially when new programmers are 
added to the team or when complex refactorings need to be 
performed clear test code is invaluable. To maintain this 
freshness, test code also needs to be refactored. We define test 
refactorings as changes (transformations) of test code that: (1) 
do not add or remove test cases, and (2) make test code better 
understandable/readable and/or maintainable. The production 
code can be used as a (simple) test case for the refactoring: If a 
test for a piece of code succeeds before the test refactoring, it 
should also succeed after the refactoring. This, obviously also 
means that you should not modify production code while 
refactoring test code (similar to not changing tests when 
refactoring production code). 



While working on our test code, the following refactorings 
are encountered: 

A. Integral Resource 

To remove the dependency between a test method and 
some external resource, we incorporate the resource in the test 
code. This is done by setting up a fixture in the test code that 
holds the same contents as the resource. This fixture is then can 
be used instead of the resource to run the test. A simple 
example of this refactoring is to put the contents of a file that is 
used into some string in test code. 

B. Resource Allocation 

If it is necessary for a test to rely on external resources, 
such as directories, databases or files, make sure the test that 
uses them explicitly creates or allocates these resources before 
testing and releases them when done (take precautions to 
ensure the resource is also released when tests fail). 

C Identified Uniquely 

Lot of problems originate from the use of overlapping 
resource names; either between different tests run done by the 
same user or between simultaneous test runs done by different 
users. Such problems can easily be overcome using unique 
identifiers for all resources that are allocated, such as including 
a time-stamp. When you also include the name of the test 
responsible for allocating the resource in this identifier, you 
will have less problems finding tests that do not properly 
release their resources. 



D. Minimize Data 

Minimize the data that is setup in fixtures to bare 
essentials. This will have two advantages: (1) in making them 
better suitable for documentation and consequently (2) the tests 
will be less sensitive to changes. 

E. Assertion Explanation 

Assertions in the JUnit framework have an optional first 
argument to give an explanatory message to the user when the 
assertion fails. Testing becomes much easier when you use this 
message to distinguish between different assertions that occur 
in the same test. May be this argument should not have been 
optional. 

F. Add Equality Method 

If an object structure needs to be checked for equality in 
tests, an implementation for the "equals" method for the 
object's class needs to be added. You then can rewrite the tests 
that use string equality to use this method. If an expected test 
value is only represented as a string, explicitly construct an 
object containing the expected value and use the new equals 
method to compare it to the actually computed object. 
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VII. Conclusion 

The large refactoring can improve overall quality of a test 
case using these set of smells choices. The only concern needs 
to be understand the selection of refactoring choices. But which 
refactoring choices should be implemented? We advocates 
program slicing in conjunction with code smell to guide 
refactoring process. By slicing the software system one or more 
bad smells, different refactoring options can examined and 
evaluated using these sets of smells. Thus the combination of 
program slicing and set of code smells guides the refactoring 
process. 

A software system essentially needs the refactoring systems 
for its better performance. Thus this refactoring process assist 
in its high quality and can prove to be more maintainable 
techniques. This refactoring process thus can be executed in 
lower error rates, fewer test cases per module and to increase 
over all understandability and maintainability in return. In both 
the design and maintenance phases, these advantages can be 
realized almost immediately. 
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Abstract-An image containing textual 
information is called a document image. The 
textual information in document images is 
useful in areas like vehicle number plate 
reading, passport reading and cargo container 
reading and so on. Thus extracting useful 
textual information in the document image 
plays an important role in many applications. 
One of the major challenges in camera 
document analysis is to deal with the wrap and 
perspective distortions. In spite of the 
prevalence of dewrapping techniques, there is 
no standard efficient algorithm for the 
performance evaluation that concentrates on 
visualization. 

Wrapping is a common appearance document 
image before recognition. In order to capture 
the document images a mobile camera of 
2megapixel resolution is used. A database is 
developed with variations in background, size 
and colour along with wrapped images, blurred 
and clean images. This database will be 
explored and text extraction from those 
document images is performed. In case of 
wrapped images no efficient dewrapping 
techniques have been implemented till date. 
Thus extracting the text from the wrapped 
images is done by maintaining a suitable 
template database. Further, the extracted text 
from the wrapped or other document images 
will be converted into an editable form such as 



Notepad or MS word document. The 
experimental results were corroborated on 
various objects of database. 
Keywords: Dewrapping, Template Database, 
Text Extraction. 

I. Introduction 

An image may be defined as a two 
dimensional function f(x, y), where x and y are 
spatial co-ordinates and the amplitude of / at 
any pair of co-ordinates(x, y) is the intensity or 
gray level of the image at that point. When x, y 
and the intensity values of /are all finite, the 
digital image is composed of finite number of 
elements where each has a particular location 
and value. These elements are called picture 
elements, image elements, pels and pixels 
[7] [14]. Image processing can be broadly 
categorized into two classes. The first category 
takes images as input and gives the images as 
output. The other category takes images as 
input and gives the attributes of images as 
output. The entire processing can be listed as: 
(i). Image enhancement- It involves 
manipulating an image so that the result is 
more suitable than original for processing, 
(ii). Image restoration- It involves improving 
the appearance of an image based on 
mathematical or probabilistic model of image 
degradation. 

(iii). Colour image processing- Colour can be 
used as factor or basis for extracting features 
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of interest in an image. 

(iv). Compression- This reduces the storage 

required to save an image or bandwidth to 

transmit an image. 

(v). Morphological image processing- It deals 

with the tools for extracting image components 

that are useful in the representation and 

description of shape. 

(vi). Segmentation- It deals with the 

partitioning of an image into constituent parts 

namely autonomous and rugged segmentation 

[7] [4]. 

A document is a bounded physical or 
digital representation of a body of information 
with capacity (and usually intent) to 
communicate. Document image processing and 
understanding has been extensively studied 
over the past 40 years that has carved a niche out 
of the more general problem of computer vision 
because of its pseudo binary nature and the 
regularity of the patterns used as a "visual" 
representation of language. In the early 1960s, 
optical character recognition was taken as one of the 
first clear applications of pattern recognition and 
today, for some simple tasks with clean and well- 
formed data document analysis is viewed as a 
solved problem. Unfortunately, these simple tasks 
do not represent the most common needs of the 
users of document image analysis. The challenges 
of complex content and layout, noisy data and 
variations in font and style presentation keep the 
field active. 

Traditionally, document images are scanned 
from pseudo binary hardcopy paper manuscripts 
with a flatbed, sheet-fed, or mounted imaging 
device. Recently, the community has seen an 
increased interest in adapting digital cameras to 
tasks related to document image analysis. Digital 
camcorders, digital cameras, PCcams, PDA's 
(personal digital assistant) and even cell phone 
cameras are becoming increasingly popular and they 
have shown potential as alternative imaging devices. 



Although they cannot replace scanners, they are 
small, light, easily integrated with various networks 
and more suitable for many document capturing 
tasks in less constrained environments. These 
advantages are leading to a natural extension of the 
document processing community where cameras are 
used to image hardcopy documents or natural scenes 
containing textual content [12]. 

Cameras in an uncontrolled environment 
have triggered a lot of interest in the research 
community over the last few years and many 
approaches have been proposed. However, there has 
been no satisfactory work presented for de wrapping 
techniques so far. Wrapping is a common 
appearance in camera captured document images 
[13]. It is the primary factor that makes such kind of 
document images hard to be recognized. Therefore 
it is necessary to restore wrapped document image 
before recognition. The documents captured from 
cameras often suffer from various distortions, like 
non-planar (wrapped) shape, uneven light shading, 
motion blur, perspective distortion, under-exposure 
and over-exposure. But current Optical Character 
Recognition (OCR) systems do not deal with these 
distortions when applied directly to wrapped 
camera-captured document images. 

Images when captured will suffer from 
distortions such as noise, blur and so on. In order to 
perform operations on document the distortions 
have to be removed. Noise removal and blur 
removal is done using filters. There are several 
types of filters available among them the Gaussian 
filter is the most efficient filter. Gaussian filters are 
a class of linear smoothing filters with the weights 
chosen according to the shape of the Gaussian 
function. The Gaussian smoothing filter is a very 
good filter for removing the noise drawn from a 
normal distribution. Gaussian functions are 
rotationally symmetric in two dimensions i.e. the 
amount of smoothing performed by the filter is the 
same in all directions. In image sharpening the goal 
is to highlight fine details in an image. That is, to 
enhance details that have been blurred. Fine details 
in the frequency domain correspond to high 
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frequencies, thus the use of high-pass filters for 
image sharpening [3] [10]. 

Text detection refers to the 
determination of the presence of text in a given 
frame (normally text detection is used for a 
sequence of images). Text localization is the 
process of determining the location of text in 
the image and generating bounding boxes 
around the text [2] [7]. Text tracking is 
performed to reduce the processing time for 
text localization and to maintain the integrity 
of position across adjacent frames. Although 
the precise location of text in an image can be 
indicated by bounding boxes, the text still 
needs to be segmented from the background to 
facilitate its recognition. This means that the 
extracted text image has to be converted to a 
binary image and enhanced image is then used 
for text extraction. Text extraction is the stage 
where the text components are segmented from 
the background, enhancement of the extracted 
text components is required because the text 
region usually has a low -re solution and is 
prone to noise. 

II. Literature Survey 

Jian Liang, et.al. proposed a method that is 
focused on analyzing text and documents 
captured by a camera which is known as 
camera-based analysis of text and documents. 
Camera based document analysis is more 
flexible to provide capability to capture 
information for visual communication, 
indexing, reading graphical text in web pages. 
In camera based analysis of text and 
documents, sources of images used are paper 
based, printed handwritten documents, journal 
etc. Scanner based process provides good 
reference and starting point, but they cannot be 
used directly on camera-captured images. 

Content in an image can be perceptual 



or semantic content but the text within an 
image is of more interest as it describes the 
contents of the image. It can be easily 
extracted compared to the semantic contents. A 
variety of approaches to Text Information 
Extraction(TIE) from images and videos have 
been proposed for specific applications 
including page segmentation, address block 
location, number plate location and content 
based image or video indexing. Text extraction 
system has various applications such as 
portable computers, content based 
video/document coding, license plate 
recognition and video content analysis. To 
enhance performance of text information 
system it is advantageous to merge various 
sources as proposed by Keechul et.al. 

Portable digital cameras are now used 
for digitalizing documents and as a fast way to 
acquire document images taking advantage of 
their low weight, portability, low cost, small 
dimensions etc. Several specific problems arise 
in this digitization process. Rafael et.al, 
addressed the inherent problems of document 
image digitization using portable camera. 
Their work was based on an issue that 
documents make use of translucent paper in 
such a way that back-to-front interference was 
not observed. Also when a document image is 
taken from the camera the strobe flash causes 
an uneven illumination of the document. 
Marginal noise, not only drops the quality of 
the resulting image for CRT screen 
visualization, but also consumes space for 
storage and large amounts of toner for printing, 
which alters the segmentation algorithm of the 
optical character recognition and thus affects 
the response obtained in the number of 
characters and words correctly transcribed. It 
assumes that the background may be of any 
colour or texture, provided that there is a 
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colour difference of at least 32 levels between 
the image background and at least one of the 
RGB components of the most frequent colour 
of the document background (paper). Two 
different experiments were set to evaluate lens 
distortion. The first one is visual inspection by 
humans, while the second one is based on 
analyzing the effect of the compensation of 
lens distortion in optical character recognition. 

In monochromatic images no 
iconographic or artistic value saves storage 
space and bandwidth in network transmission. 
The binarized preprocessed documents 
lowered the number of character substitution 
and it has also raised the incidence of insertion 
errors due to spurious noise inserted in the 
image. This paper presents ways to 
significantly improve the visualization of 
images whenever displayed on screen of CRT 
or LCD or printed. 

Celine and Bernard presented the 
solution which is independent of scenes, 
colours, lighting and all various conditions. 
Their algorithm was based on multi-hypothesis 
text extraction. 

Yu Zhanga et.al. proposed an 
algorithm based on binary document images 
which considers the horizontal text that is 
mostly present in both Arabic and Chinese 
characters. Wrapped document images should 
have text lines with a main direction of 
horizon. Thus several pairs of key points when 
mapped using Thin Plate splines(TPS) will 
restore the original image based on an 
interpolation algorithm[5] [8] . 

Syed Saqib Bukhari et.al. used a 
novel dewrapping approach based on curled 
text-lines information, which was extracted 
using ridges based modified active contour 
model (coupled snakes). This dewrapping 
technique is less sensitive with different 



direction of curl and variable line spacing. The 
optical character recognition error rate from 
wrapped to dewrapped documents was reduced 
from 5.15% to 1.92% for the dataset collected. 
The approaches for document image 
dewrapping can be divided into two main 
categories based on the document capturing 
methodology: one in which specialized 
hardware arrangement like stereo camera is 
required for 3D shape reconstruction of 
wrapped document and the other approach in 
which dewrapping method is designed for 
image that is captured using single hand-held 
camera in an uncontrolled environment. 

Faisal Shafait presents an overview of 
the approaches based on evaluation measure 
and the dataset used. The methods used are 
continuous skeletal image representation for 
document image dewrapping, Segmentation 
based document image dewrapping, 
Coordinate Transform Model (CTM) and 
document rectification for book dewrapping. 
Dewrapping of documents captured from 
hand-held cameras has triggered a lot of 
interest and thus many approaches have been 
proposed to achieve that. However, there has 
been no comparative evaluation of different 
dewrapping techniques so far. A dataset of 102 
documents captured with a hand-held camera 
were created and made freely available online. 
A text-line, text-zone, and ASCII text ground- 
truth for the documents in this dataset were 
made. The results showed that the CTM 
presented by Wenxin Li et al. performed better 
than the other two methods, but the difference 
was not statistically significant. Overall, all 
participating methods worked well and the 
mean edit distance was less than 1% for each 
of them. 

Based on the literature survey our 
main aim is to recover document images. 
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Hence the proposed technique was applied on 
gray scale document images and is based on 
several distinct steps like an adaptive 
document image binarization, a text line and 
word detection, a draft binary image 
dewrapping based on word rotation and 
shifting and finally a complete restoration of 
the original gray scale wrapped image guided 
by the binary dewrapping. The problems 
encountered are background removal, skew 
often found in the image in relation to the 
photograph axes, as documents have no fixed 
mechanical support in the document image. 
A. DATA SAMPLE DESCRIPTION 
Text data present in images and video contain 
useful information for automatic annotation, 
indexing and structuring of images. Extraction of 



this information involves detection, localization, 
tracking, extraction, enhancement and recognition 
of the text from a given image. The data samples 
for text extraction were captured as image from a 
2Megapixel mobile camera with a resolution of 
(960 X 1280). The size of each image captured 
varies between (60 - 80KB). The variations of text 
present in the images captured will be due to 
variations in size, style, orientation and alignment 
followed with a low image contrast and complex 
background. 

The images that were captured are 
manually classified/ categorized into blur, 
clean and wrapped images as shown in Fig 1. 



Blur Images 


Clean Images 


Wrapped images 
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Figure 1: Blur Images, Clean Images and Wrapped images 



III. IMPLEMENTATION 

The process is divided into three main phases 
namely preprocessing phase, dewrapping phase 
and the text extraction phase. In preprocessing 
phase, the quality of an image is enhanced. In 
dewrapping phase, the wrapped document images 
(the images which are captured from the 
cylindrical objects surface) are dewrapped. In the 



text extraction phase, the text in the document 
image is detected, localized and finally extracted 
into editable form. 

1. Pre-processing Phase: The experimental 
setup for capturing an image containing text 
requires a 2Megapixel mobile camera which is 
placed at a standard distance. The measuring 
scale is used to measure the distance of an 
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image from the mobile camera. Further, the 
intensity of light is also varied. The images are 
captured in a controlled environment at the 
distances of 10cm, 13cm and 15cm 
respectively. 

1.1 Preprocessing with Different Image 
Documents: Preprocessing involves several 
steps which are presented in the following 
section. In preprocessing we tend to remove 
noise, blur operation and sharpening as shown 
in Figure 2. 

2. De wrapping Phase: De wrapping is a two-step 
approach at the first step, a coarse dewrapping is 
accomplished with the help of a transformation 
model that maps the projection of a curved surface 
to a 2D rectangular area. The projection of the 
curved surface is delimited by the two curved lines 
which fits the top and bottom text lines along with 
the two straight lines that fit to the left and right 



text boundaries. Towards the second step, a fine 
dewrapping is achieved based on words detected. 
All words pose as normalized guide lines by the 
lower and upper word baselines. 
3. Text extraction Phase: Text data present in 
images and video contains useful information for 
automatic annotation, indexing and structuring of 
images. Extraction of this information involves 
detection, localization, tracking, extraction, 
enhancement and recognition of the text from a 
given image. 

However variations of text due to differences in 
size, style, orientation and alignment as well as low 
image contrast and complex background make the 
problem of automatic text extraction extremely 
challenging. 



Pre-processing Phases 


Converting the input RGB 
image into grayscale image: 
I = rgb2gray(RGB) 


Converting gray image to 

binary image: 

BW = im2bw(I, level) 


Blur removal: 
h = fspecial(type) 
B = filter(A,H) 


Sharpening: 

h = fspecial('unsharp', 
alpha) 
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Figure 2: Block diagram of preprocessing 

V. Algorithm 

Stage 1: Convert the input RGB image into 
grayscale image: 

(The RGB values are normalized to a single 
gray scale value. Grayscale images are distinct 
from one-bit black-and-white images, which in the 
context of computer imaging are images with only 
two colours black, and white [bi-level images]). 



Stage 2: Convert grayscale image into binary 
image: 

Stage 2.1: BW = im2bw (I, level) 

The output image BW replaces all pixels 
in the input image with luminance greater than 
level with the value 1 (white) and replaces all other 
pixels with the value (black). 
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Stage 3: Blur removal: 

Stage 3.1: h = f special (type) 

(Creates a two-dimensional filter h of the 
specified type, f special returns has a correlation 
kernel, which is the appropriate form to use with 
imfilter.type is a string having one of the following 
values: average, disk, gaussian) 

Stage 3.2: B = imfilter (A, H) 
(Filters the multidimensional array A with the 
multidimensional filter H.) 
Stage 4: Sharpening: 

Stage 4. 1 : h = fspecial ('unsharp', alpha) 

(Returns a 3/3 unsharp contrast enhancement filter, 
fspecial creates the unsharp filter from the negative 
of the Laplacian filter with parameter alpha, where 
alpha controls the shape of the Laplacian and must 
be in the range (0.0 to 1.0). The default value for 
alpha is 0.2.) 
Stage 5: Dewrapping: 

Based on the background of the image, different 
approaches have been proposed for document 
image dewrapping. These approaches can be 
divided into two main categories based on the 
document capturing methodology: (i) approaches 
in which specialized hardware arrangement like 
stereo camera, is required for 3D shape 
reconstruction of wrapped document and (ii) 
approaches in which dewrapping method is 
designed for image that is captured by using single 
hand-held camera/Mobile in an uncontrolled 
environment. In our present work, we deal with the 
second approach using Mobile phones where 
documents are captured using mobile on curved 
surfaces [8]. 

VII. Special Cases 

We have also worked with certain typical 
special cases where recognition of character 
was not highly relevant to the one present in 
the given document image which is mainly due 



to the style in which characters are represented 
in the given document image as illustrated in 
figure 3. 



INPUT DOCUMENT 
IMAGE 



OUTPUT EDITABLE FORM 






File Edit Format View Help 



The letter T could not be 
matched with the template T , 



File Edit Format I 



Since the degree of wrapping is 
high, the letter 'T' could not be 
matched. 



File Edit Format View Help 






The digit '8' has high degree of 
wrapping. Thus 8 is recognized as 

'5'. 



Figure 3: Typical Special Cases 
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VIII. Experimental Results 

Typical results on the various operations of the 
proposed approach are as shown in Fig.4. 



1234^ 

A. Input Image: Fig 4.1 The input document image 
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B Preprocessing: 

Fig 4.2 Input image, Motion blurred image, Gray 

image, Sharpened image 
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B.\. Noise Removal using bwareaopen 
Fig 4 .3 Input image without noise 




B.2. Binary image: 
Fig 4.4 Gray image converted into Binary image 
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C. Text localization: 

Fig 4.5 Bounding box for the text in the document 

image 
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D. Text Extraction: 
Fig 4.6 Characters extracted individually 



I text - Notepad 



1°|E3| 



File Edit Format View Help 
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E. Editable document: 
Fig 4.7 Text converted into editable format 



Figure 4: Various Operations performed 

IX. Conclusion 

Image processing is a method that has wide 
applications in disciplines related to a 
researcher's preview. In the present work, we 
have taken up an interesting concept of 
document image analysis in a broader sense 
but we have restricted ourselves to text 
wrapping. 

In specific, document analysis plays a 
very important role in image processing since 
any information present that has to be 
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authenticated will be in a form of document 
only. Thus in the present problem, we have 
used a concept of template matching to 
evaluate the text present in an input image with 
database of characters that have been taken as 
a knowledge during the process of training. 

Image wrapping or dewrapping may 
be implemented using texture mapping by 
defining a correspondence between a uniform 
polygonal mesh and a wrapped mesh. The 
points of the wrapped mesh are assigned the 
corresponding texture coordinates of the 
uniform mesh and the mesh is texture mapped 
with the original image. Using this technique, 
simple transformations such as zoom, rotation 
or shearing can be efficiently implemented. 

This paper provides various tools to 
play with images and also opens up new 
avenues in the areas such as text extraction and 
detection. Typically the approach is focused to 
convert the input document image, that is text 
(which might be a text present in an image) 
into its gray image constituents. Before 
performing any operations the present paper 
takes into consideration the various pre- 
processing stages such as blur, cleaning and 
sharpening to cognize a knowledge base. The 
algorithm designed for the present work is able 
to identify the alphanumeric text into its 
relevant values i.e. its alphabets (capital A-Z & 
small a-z) and numerals (0-9) that can have the 
properties of skew also. 
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Abstract — In order to protect secret information from sensitive 
and various applications, secured authentication system should 
be incorporated; it should contain security and confidentiality. 
Even if it is assumed that the cryptographic primitives are 
perfect, the security goals may not be achieved: the system itself 
may have weaknesses that can be exploited by an attacker in 
network attacks. In this paper a Secured Authentication 
Protocol System using Images (SAPSI) is presented. It ensures 
confidentiality, and authentication using server and Image based 
authentication mechanism. 

Keywords- Confidentiality, Security, Server, Image-Based 
Authentication System, Authentication, 



I. 



Introduction 



A significant challenge in providing an effective network 
system defence mechanism is to detect the intrusions and 
implement counter-measures. Organizations who use Secured 
Authentication system tolerate no leakage at all. 
Cryptographic primitives are useful tools but security of the 
primitives does not guarantee security of the system. Usage of 
different level of security provides a security policy that 
allows the classification of data and users based on a system of 
hierarchical security levels combined with a system of non- 
hierarchical security categories. [1, 5, 6]. 

Cryptographic mechanisms are communication systems 
that rely upon cryptography to provide security services across 
distributed systems. Applications increasingly rely on 
encryption services provided by cryptographic systems to 
ensure confidentiality and authentication during secure 
transactions over the network. However the security provided 
by these encryption services might be undermined if the 
underlying security system has any flaws in the design or 
implementation. Weaknesses in security systems such as 
misuse of encryption, compromising the private encryption key 
etc., are yet to be addressed. [8]. 

Secured Authentication System is an application of a 
computer system to process information with different 
sensitivities (i.e. classification of information at different 
levels) to permit simultaneous access by users with different 
security clearance and to prevent users from obtaining access 
to information for which they lack authorization. Secured 
Authentication has two goals: first goal is to prevent 
unauthorized personnel from accessing information. Second 



goal is to prevent unauthorized personnel from declassifying 
information. The traditional view of secured authentication is 
one of ensuring that information at a high security 
classification cannot flow down to a lower security 
classification. [1, 3, 12]. 

In this paper, Secured Authentication Protocol System 
using Images is proposed. It overcomes the identified 
drawbacks of existing systems. The attacks on existing model 
embedded in encrypted sessions are detected as monitoring the 
processes taking part in the systems is integrated. The new 
system uses encryption mechanisms. Hence the inside 
information is protected and also the outside attacks are 
prevented. To establish this, a server with authentication 
mechanism is used. Types of attacks were proscribed in the 
proposed system are Brute force attack, Dictionary attack, 
Keyloggers, Shoulder Surfing, Man-In-The-Middle attack and 
Database Server Compromise attack. 

Brute force attack. The hacker can try two kinds of Brute 
force attacks on this system. One is re-using of images and 
another is without re-use of images. For a user, there will be a 
unique password of length 8 or above selected in SAPSI for 
the given session. Possible image patterns were dynamically 
changed on every session along with random numbers. By 
performing this attack in SAPSI hacker unable to break the 
password because it needs two processes. 

Dictionary attack. Dictionary attack is one of the most 
commonly used techniques to break a Password-based system. 
If same kind of sequences appeared in the network for a long 
time it can be guessed by the hacker. 

Keyloggers. Keylogger is a program, which captures the 
user's keystrokes and sends this information to the hacker. 
The natural protection for an authentication system from the 
keylogger is to have a one-time password (or Dynamic 
password). 

Shoulder Surfing. Shoulder surfing is looking over 
someone's shoulder when they enter a password or a PIN code. 
It is an effective way to get information in crowded places 
because it is relatively easy to stand next to someone and 
watch as they fill out a form, enter a PIN number at an ATM 
machine, or use a calling card at a public pay phone. Shoulder 
surfing can also be done at a distance with the aid of 
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binoculars or other vision-enhancing devices to know the 
password. 



Man-In- The-Middle Attack. A man in the middle attack is 
one in which the attacker intercepts messages in a public key 
exchange and then retransmits them, substituting his own 
public key for the requested one, so that the two original 
parties still appear to be communicating with each other. 

This strategy is implemented to protect information from 
unauthorized disclosure or modification and to provide 
mechanisms to authenticate users participating in the exchange 
of information. [7] . 

In section 2 related works are discussed with their 
drawbacks. 

Section 3 discusses the overview of Proposed Secured 
Authentication System with server and Authentication 
mechanism using images methodology. 

In section 4 implementation details related to the system are 
presented. Conclusion is given in section 5. 

II. RELATED WORK 

Enhanced authentication mechanism using multilevel 
security model (EAMMSM) is the system that belongs to and 
applies multilevel security. Any sensitive application it 
includes confidential and secret information which must be 
used effectively in complicated and authenticated procedures. 
Using five levels of authentication methods with a set of 
privileges assigned, each user has to surpass 50% of every 
level to get the privileges rights. [1]. 

During authentication the information was hacked from the 
network plane using network analyser tool. Leakage of 
information occurred in three levels while transmitting 
answers with username and multiple questions methods. 
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In Improving text password through persuasion (ITPTP), 
users entered their passwords with visibility. [2]. 

Users tend to choose their passwords in a simple manner by 
entering visibility method, which makes the hacker to know 
with shoulder-surfing process. 



An authentication method combining text and graphical 
passwords (AMCTGP), and users selecting their passwords 
using random numbers assigned to images, is given in [1 1]. 

Users selecting their passwords by clicking random 
numbers listed in the selection panel can be identified by a 
hacker using movie-clip camera phones. 

In Multiple password interference in text and click-based 
graphical passwords (MPITCGP), users select their passwords 
from the given image as pass points. [10]. 

Users' selecting their passwords from the given image is a 
hectic process. If any mismatch of pass points occurred the 
original user itself would be unable to get authentication even 
by knowing pass point selections. 

In Pass Pattern System (PPS): A Pattern-Based User 
Authentication Scheme, data hacked from database through 
database compromise server attack is represented. [7]. 

There are several attempts reported in literature about 
authentication schemes in lieu of the traditional Password- 
based system. Each attempt is successful in increasing the 
strength of the system against some of the known attacks. 

They are either computationally intensive or they require 
additional hardware/software in the infrastructure. In this 
section we review the current attempts, identify the gaps and 
emphasize the motivation for developing Secured 
Authentication Protocol System using Images. 
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Figure 1: Secured Authentication Protocol System using Images Flow Diagram 
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Motivation for Secured Authentication Protocol System 
using Images: We proposed secured authentication system is 
robust against attacks such as the brute force, shoulder surfing, 
social engineering, database server compromise attack and 
Man-In-The-Middle attacks. It incorporates the essence of 
Image-based authentication system. 
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The client has to enter the index numbers according to the 
selected images in an order given during registration. As per 
the selection made during registration, the client has to enter 
index numbers now as 29, 34 and 61. 



III. SECURED AUTHENTICATION PROTOCOL SYSTEM 
USING IMAGES 

This system involves the use of authentication mechanism 
and a server that minimizes the hacking by the attackers. It 
monitors the clock cycle process effectively. Two processes 
are involved in this system. They are a) Authentication using 
Images and b) Security Questions Authentication using server 
represented as flow diagram in Figure 1 . 

A. Authentication using Images 

This is a Image-based authentication system based on the 
premise that 'humans are good at identifying, remembering 
and recollecting graphical image patterns than text 
patterns '.[9]. 

In SAPSI the client gets authenticated in two levels. In the 
first level the client gets authenticated using username and 
password method with graphical image patterns. It is 
illustrated in Figure 2. 

For providing the password the client has to enter the index 
number provided at the images. While entering index 
numbers in the password area it will be hidden and bullet 
marks will be displayed. For example, if the client chooses 
images rose, white lion and lord shiva then the index numbers 
27, 44 and 17 should be entered in a selected order. While 
confirming password images index numbers were shuffled, so 
user has to re-enter the password by giving different index 
numbers according to the images chosen. Here both image 
patterns and index numbers are represented as dynamic 
arrangements in every login attempt. Due to this setup no one 
would be able to read or guess the mechanism involved. 

For every authentication the images were shuffled and 
index numbers were varied and shuffled. It is represented in 
Figure 3. 
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Each image will be mapped with a corresponding number 
which is stored in the Image-Map table. Instead of comparing 
the images, the mapped numbers are compared. It serves as 
user friendly for the end-user and machine friendly for the 
system by reducing the comparison time by using numbers 
rather than images. A mapping mechanism which validates 
the index numbers with hidden letters is represented in Table I. 




Figure 3: A sample shuffling mechanism of Secured 
Authentication Protocol System using Image Patters. 

The client can select the images on some sequences familiar 
to him/her. Due to shuffling mechanism, this method reduces 
the guess ability of the persons who are related to the clients. 
During entry of password, only bullets appear in the password 
area which avoids the shoulder surfing attacks. 

When sending random numbers in the network plane, it will 
be converted into a computed ascii value, so that Man-In-The- 
Middle attack is prohibited. 
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A Sample image-map mechanism for SAPSI 



Figure 2: A sample Secured Authentication Protocol System 
using Image Patterns 
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Using this mapping mechanism the shuffling process of 
images and index numbers are generated. The images are 
validated only by using the hidden characters and index 
numbers which reduce the time complexity of comparing the 
images. 

The image positions are generated using permutation 
sequences. Let A = {II, 12, 13}, this set can be arranged in 3! 
ways as, 

[II] [12] [13] 

[II] [13] [12] 

[12] [II] [13] 

[12] [13] [II] 

[13] [II] [12] 

[13] [12] [II] 
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Image Pattern setup there will be 64 n (if selection of images 



includes reuse of images) or 
different images of length n. 



P n (without reuse of images) 



Number of possible Image Patterns : 



r(N 2 ) n 



(N 2 )! 



(N-n)l 



Number of possible Image Patterns for the size of N x N 
matrix with re-use of images as passwords (N 2 ) n is illustrated 
in Table II and without re-use of images as passwords (N 2 )!/ 
(N 2 - n)l is represented in Table III. 

TABLE II 



For n images n! Sequences were generated and it will be 
used randomly for every attempt of registration or login. 

Security Potency of Secured Authentication Protocol 
System using Images: 

In general, several attacks are possible on an authentication 
system. For any authentication system, the hacker can attack 
at least at three places: they are server, client and the 
communication link. The attack on server includes Brute 
force attack, Dictionary attack and compromising the server as 
a whole. At the client, the possible attacks are key logging 
and shoulder surfing. Finally on the communication link, the 
possible attack is Man-In-The-Middle attack, which can be 
done using packet sniffers. [7]. 

In terms of the data being passed from the user to the server 
the data stored in the secured server is comparable with the 
classical Password-based authentication system. In both cases, 
user sends the username and a password. This will be 
compared with the registry in the database. But because of the 
dynamic nature of password selection system, SAP SI is more 
secure than ordinary password-based scheme to attacks such 
as Brute force, Dictionary attack, Keylogger, Shoulder surfing 
and Server database compromise attack. The best known 
solution for such attacks is to use cryptography protocols at 
the server or on the communication link. In this we analyse 
the impact of the four attacks mentioned here on SAPSI. 

On analysing Brute force attack - I in SAPSI, if the hacker 
wants to guess the password, the probability of success will be 
l/(64 4 ) = 5.96046E-08 (Since there are unlimited images, 64 
images are taken as sample). If the guess is wrong, probability 
of success will remain the same for the next guess. It is 
because the password will change with every attempt. 
Hence, 

The probability of success for every attempt = l/64 n 

The other way of doing Brute force attack - II is to try all 
combinations of positions. For example, if we consider a 8x8 



POSSIBLE RE-USE OF IMAGE PATTERNS 



Length of the Image Password - n 



Size of 

Matrix - 

N 


4 


6 


8 


10 


12 


4 


65536 


16777216 


4294967296 


1.0995E+12 


2.81475E+14 


6 


65536 


2.177E+09 


2.8211E+12 


3.6562E+15 


4.73838E+18 


8 


16777216 


6.872E+10 


2.8147E+14 


1.1529E+18 


4.72237E+21 


9 


43046721 


2.824E+11 


1.853E+15 


1.2158E+19 


7.97664E+22 


10 


100000000 


1E+12 


1E+16 


1E+20 


1E+24 


11 


214358881 


3.138E+12 


4.595E+16 


6.7275E+20 


9.84973E+24 


12 


429981696 


8.916E+12 


1.8488E+17 


3.8338E+21 


7.94968E+25 


13 


815730721 


2.33E+13 


6.6542E+17 


1.9005E+22 


5.42801 E+26 



TABLE III 
POSSIBLE IMAGE PATTERNS WITHOUT RE-USE OF IMAGES 

Length of the Image Password - n 



Size of 

Matrix 

-N 


4 


6 


8 


10 


12 


4 


43680 


5765760 


518918400 


2.9059E+10 


8.718E+11 


6 


1413720 


1402410240 


1.2201 E+12 


9.2239E+14 


5.996E+17 


8 


15249024 


53981544960 


1.7846E+14 


5.4967E+17 


1.573E+21 


9 


39929760 


2.33669E+11 


1.2969E+15 


6.8163E+18 


3.388E+22 


10 
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8.58278E+11 


7.5031E+15 


6.2816E+19 


5.032E+23 


11 


203889840 


2.76719E+12 


3.6278E+16 


4.5913E+20 


5.606E+24 


12 


412293024 


8.02322E+12 


1.5169E+17 


2.785E+21 


4.963E+25 


13 


787083024 


2.12985E+13 


5.6241 E+17 


1.4488E+22 


3.64E+26 



To break the system, the hacker on an average has to break 
(nx64 n )/2 images (with reuse) or (nx 64 P n )/2 (without reuse). 



Number of images that are to be broken : 



~n (N 2 ) n 



< 



nN 2 ! 



2(N -n)! 
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Number of images that are to be broken by evaluating the 
length of the password in Brute force attack - I method is 
depicted in Graph 1 and Brute force attack - II is represented 
in Graph 2. 



Bruteforce Attack - 1 
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Vol 8, No. 8, 2010 
Shoulder surfing can be done easily on the password system, 
just by seeing the keys that the user is typing. But to decode 
the password in SAPSI, the hacker has to see both the key 
sequence and Image patterns and do a mapping before user 
submits the page. So shoulder surfing is of little or no use in 
SAPSI as compared to a password-based system. 
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Graph 1: Number of Images that are to be broken with 
reuse of Images. 
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Bruteforce Attack - II 
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Graph 2: Number of Images that are to be broken without 
reuse of Images. 

N represents the size of the Image Patterns and n represents 
the length of the password. 

Analysing Dictionary attack in SAPSI, commonly used 
images with client guess sequences can be possible (if images 
and random numbers are static). However, here the image 
pattern changes randomly on every presentation or session; it 
approaches the behaviour of one-time pad. 

SAPSI, being a dynamic password system, is not vulnerable 
to keyloggers. Even if the hacker gets the password of the 
client of a SAPSI system, this password cannot be reused by 
the hacker to login to the system, because of the dynamic 
nature of the Image Pattern system. 



In the case of SAPSI, using Man-in-the-middle attack the 
attacker is not able to get original messages because the 
images and random numbers changed dynamically on every 
presentation or session. 

Comparing these attacks and it is represented in Graph 3 . 

The images used for password selection can be of any kind. 
Depending on the application it can be varied. For sample 
discussion nature images were used. For implementation 
characters, numbers and special characters were used as 
images. Two digit and three digit random numbers were used 
in implementation. In compact display applications two digit 
random numbers preferred and in large display applications 
three digit random numbers preferred to mystify hackers. 



COMPARISON OF ATTACKS IN AUTHENTICATION SYSTEMS 
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Graph 3: Comparison of Attacks in Authentication Systems 
using Existing and Proposed System. 

B. Security Questions Authentication 

In second level the client gets authenticated using security 
questions. A 10-digit number is issued to the client at the time 
of registration. The client has to answer three security 
questions and the results are encrypted with a 10-digit number. 
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A resultant factor is passed over the network plane for 
validation to the server. 

Encryption Process 

Three security questions queried (si, s2, and s3). 
Ascii value evaluated for two security questions. (al 
and a2) 
Bitwise operation is performed, 

- suml=(al & a2) | s3 
resultant factor (sum2) = suml id 
Ascii value of resultant factor (sum2) send to verifier. 

Verification Process 



Evaluation of time difference for Existing and Proposed Methods 

3 

2.5 

! 2 

: 1 - 5 

; 1 

: 0.5 

o 



-Existing 



400 600 
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Figure 5: Evaluation of time difference for existing and 
proposed methods. 



During Client registration a shared 10-digit key (id) 
and resultant factor (sum2) issued to server. 
Authentication process: achieved result (sum3) of 
client resultant factor (sum2). 
Authentication granted - a shared 10-digit key (id) 
generated. If not then authentication denied. 

The server decrypts the resultant factor and gets the 
registration number of the client. 

After passing the Authentication using Images level and 
Security questions authentication level, the client gets 
authenticated. 

IV. ANALYSIS AND IMPLEMENTATION 

In this new system all drawbacks of existing methods are 
overcome with new secured authentication protocol system 
using images. This system is implemented both in single 
client and multiple clients with server. 
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Only two levels are used for authentication with single 
server to authenticate clients. No repetitive methods are used 
in this proposed method which does not irritate the client. No 
leakage of information is possible in this new method which 
avoids the Man-In-The-Middle attack. In [1] leakage of 
information which occurred during sublevel transitions is 
avoided in this new system. When entering password no 
visibility is there which protects the shoulder surfing attacks 
from related persons. In [2, 11] given passwords are 
processed using shoulder surfing and if any person tries to 
hack the password using capture devices, which is protected in 
new system by giving passwords in a hidden manner. Even if 
any capturing devices are used to capture images, no one will 
get information due to hidden bullets in the password area. It 
is very difficult to remember the pass points in an image [10]. 
This difficulty is avoided in this new system by selecting the 
random numbers in the images. This avoids the confused 
remembrance of pixel positions instead of whole images. 

Both existing and proposed systems were implemented and 
the time difference is evaluated and it is represented in Figure 
5. The total image position sequences were generated for 9 
images are 362880 and the results were shown in Figure 6. 
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Figure 6: Generated Image Position Sequences for 9 images. 
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V. CONCLUSION 



In order to improve the confidence in security system, 
detecting intrusions in those systems plays a vital role, as the 
security system design is not always perfect. Our system 
overcomes the problem encountered in existing systems and 
ensures the confidentiality and authentication when sending a 
message. 
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Abstract — Voice over internet protocol (VoIP) is a technology 
that uses Internet to transmit voice digital information. The 
Session Initiation Protocol (SIP) and Real time Switching (RSW) 
are signaling protocols that emerged as a new VoIP which gained 
popularity among VoIP products. In literature, many 
comparative studies have been conducted to evaluate signaling 
protocols, but none of them addressed the targeted protocols. In 
this paper, we make a comparative evaluation and analysis for 
SIP and RSW using Mean Opinion Score rating (MOS). We 
found that RSW performs better than SIP under different 
networks in terms of (packet delays). 

Keywords- VoIP; MOS; InterAsterisk eXchange Protocol; Real 
Time Switching Control Criteria and Session Initiation protocol 



I. 



Introduction 



Nowadays, the use of distributed computer network systems 
has been increased in many areas of government, academia 
and industry. Video conferencing system is one of computer- 
based communication applications. The idea of video 
conferencing appeared for the first time in the 1920s [1], The 
task of video conferencing concentrates on individuals to be 
together in space and time, and makes groups more effective 
at their work by applying different services such as telephony 
service over IP networks that are known as IP telephony or 
Voice over IP. VoIP communication usually consists of two 
protocols: (i) Signaling protocols that are used to setup a voice 
conversation and manage voice sessions (ii) Media transfer 
protocols that are used for exchanging voice data traffic during 
one session lifetime. [2]. One of the most important functions 
in the VoIP infrastructure is signaling session. Signaling 
session should be by any VoIP protocol before transmitting 
any media type. Therefore, it allows various network 
components to communicate with each other to setup and to 
tear down calls [3]. Recently, there is strong focus on the 
development of scalable VoIP protocols, such as SIP, RSW 
and IAX. 

Large efforts have been done to study SIP. It was standardized 
in the IETF (Internet Engineering Task Force) RFC 2543 and 
further extended in RFC 3261. It is used for creating, 
modifying, and terminating sessions with one or more 
participants [5][6]. 

Session Initiation Protocol has many features: (i) the service of 
text-based which allows easy implementation in object 



oriented programming languages such as Java and Perl. These 
allow easy debugging, and most importantly make SIP flexible 
and extensible, (ii) Less signaling, (iii) transport-layer- 
protocol neutral (iv) parallel search [5] [7]. 

Real time Switching appeared in late 1993 as a control 
mechanism for multimedia conferencing. It was designed by 
Network Research Group (NRG) in school of computer 
science-University Science Malaysia (USM) .The goal of 
RSW Control Criteria is how to conduct a conference around a 
meeting table [8][9][10]. Moreover, RSW is used to handle 
two issues in multimedia conferencing. The first one is to 
handle the confusion generated while everyone tries to speak 
at the same time. The second issue is the tremendous amount 
of network traffic generated by all participating sites [6]. 

More recently, InterAsterisk exchange protocol has been 
emerged as new protocol that improved the voice quality. It 
has also many features such as simplicity, NAT-friendliness, 
efficiency and robustness [2][3]. There are several goals for 
this protocol. The main goals of this protocol are (i) 
Minimizing bandwidth usage for signaling and media transfer, 
with a particular emphasis on voice (ii) Ensuring NAT 
transparency (iii) Ability to exchange dial plans (iv) Efficient 
implementation of intercom and paging features. On the other 
hand, IAX can be used with different types of streaming media 
such as video and voice calls [4]. In this paper, we have 
selected two case studies that depend on IAX protocol. In the 
first case study, the researchers made comparison between 
IAX and RSW [3]. While in the second case study the 
researchers compared between IAX and SIP [2]. According to 
the last two comparisons, we have made a comparative 
evaluation of SIP and RSW by using Mean Opinion Score 
rating (MOS). 

II. Related Work 

Before transmitting any media, the signaling session, which is 
the most important function in VoIP, allows different network 
components to communicate between each other to set up and 
terminate calls. Mean Opinion Score (MOS) is used as a 
method to assess call quality. In addition, it is used to measure 
subjective voice quality. To measure network performance, we 
use the MOS rating which is the most widely used assessment 
technique. The listening subjects were given a scale from 1 to 
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5, where 1 = bad, 
excellent [11]. 



2 = poor, 3 = fair, 4 = good, and 5 



Recently, there are many researchers concentrated on this 
function and compared between different VoIP protocols. The 
study conducted by [2] compared the performance between 
IAX and SIP protocols. They have indicated that both 
protocols perform comparably in the presence of fixed delay. 
Therefore IAX appeared to perform slightly better as shown in 
Figure 1 [2]. 



VoIP Quality in Relation to Packet Delay (SIP versus IAX) 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 8, No. 8,2010 
that the packet delay in IAX is better than that in SIP and 
RSW. The gained results are extracted using the SPSS 
statistical tool. The Interclass Correlation Coefficient 
statistical test is used to analyze IAX with comparison to the 
previous two studies. As a result, the correlation between the 
two variables is high. The single measure Interclass 
correlation coefficient is 0.995, the test value for absolute 
agreement is 0.99999, and p is approximately equal to 1. The 
IAX protocol regression formula and curve were identical in 
previous two studies. An indirect comparison between RSW 
and SIP was made directly in terms of fixed packet delay. 
Figure 3 indicates that the RSW protocol performs slightly 
better in the presence of fixed packet delay than SIP protocol. 
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Figure 1: Packet Delay - SIP versus IAX [2] 
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The researchers in [3] compared the performance between 
IAX and RSW protocols. They indicated that both protocols 
perform comparably in the presence of fixed delay. Therefore 
IAX appeared to perform slightly better than RSW as shown 
in Figure 2 [3]. 
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-IAX 
- RSW 



LOLOLoiniOLOLOu-iu~iinLnLnLnLnLnLnLnLnLo 

M(M(MM(MMM(MM(M(M(M(M(M(M(M(M(M(M 

^HrMm*d-u"i^GiP"» D QCT , io^-icMm<d-(.£>r-cQcr 1 



Delay(ms) 



Figure 2: Packet Delay - RSW versus IAX [3] 



III. Analysis and Evaluation 

The predicated evaluation is based on previous two 
comparisons that were conducted by [2] and [3]. We can see 



Figure 3: Packet Delay - SIP versus RSW 



Conclusion 

This paper suited the performance of the RSW and SIP in term 
of packet delay. From the evaluation and analysis, we 
conclude that RSW achieves more improvement than SIP in 
VoIP in relation to packet delay via the MOS score. Further 
research on RSW control criteria and SIP protocols and their 
performance under various conditions is recommended. 
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Abstract- Information is worthwhile only if it can be accessed at the 
right time, by the right person & is useful for the purpose defined. 
Health care providers have a strong tradition of safeguarding 
private health information. Today's world belongs to Information 
Technology. With information broadly held and transmitted 
electronically, the rule provides clear standards for all parties 
regarding protection of personal health information. Medical 
resources integration concerns has also been a long-standing 
problem, which need to work in collaboration with information 
technology, aiming at a common goal. The complexity and 
extension of roles of the planning system demand extensive 
seamless integrations in the organization. Medical enterprise 
resources planning (ERP) integrates each level of healthcare staff 
by providing information and knowledge in timely manner, 
making the ERP a synchronizing solution for all the roles required 
in the organization for various timely decision makings. From this 
motivation, this paper proposes a role-oriented requirement 
definition analysis for ERP implementations in the organizations. 
Integrated hospitals need a central planning and control system to 
plan patients' processes and the required capacity. Given the 
changes in healthcare one can ask the question what type of 
information systems can best support these healthcare delivery 
organizations. We focus in this review on the potential of enterprise 
resource planning (ERP) systems for healthcare delivery 
organizations. 

Keywords: Patient Logistics, Planning and control, clinical 
management, ERP, AP. 

I. Introduction 
Information technologies in healthcare is more about 
how it originally developed for manufacturing are fitted to 
support clinical and administrative work in hospitals. Common 
denominators for these information technologies are Enterprise 
resource planning systems or simply ERP. As the healthcare 
sector faces increasing demands from political and public sides 
to document increased cost- effectiveness, to make optimal use 
of still more scarce resources and to improve the quality of 
patient care, a number of IT vendors Endeavour in a quest to 
help healthcare organizations to gain control over their business 
processes. My research is concerned with the issues that arise 
when an IT-solution, based on rationalistic models of 
Healthcare best practices meets clinical practice and the work 
that needs to be done in order to align the solution to praxis or 
sometimes vice versa. Enterprise resource planning (ERP) 
is a company-wide computer software system used to 
manage and coordinate all the resources, information, and 
functions of a business from shared data stores. An ERP 
system has a service-oriented architecture with modular hardware 
and software units or "services" that communicate on a local area 
network. The modular design allows a business to add or 
reconfigure modules while preserving data integrity in one shared 
database that may be centralized or distributed [6] [7]. 120 



II. Why ERP introduce in health care industry 
Healthcare is yet another booming industry that enjoys 
unlimited growth opportunities through the globe. The obvious 
reason is that science and technology leads to both vices and 
virtues. The diseases are rapidly multiplying at a rate faster than 
the invention of medicines. No doubt it has become a very 
lucrative profession be it medical professional or paramedical 
professional or corporate hospitals. Some of the reasons for the 
increased usage and preference of ERP are as follows [9] [10]: 

A. Computerization 

ERP function becomes very important in this context. 
The attitude of personnel in hospital industry is quiet different 
from that of the others. When people quickly respond to 
computerization and automation it is practically difficult for the 
trend to follow suit in hospitals. The reason is doctors are 
keener to update about medical terminologies and surgical 
trends than any about any other office equipment. This has also 
increased the percentage of ERP use. 

This could not last for a long time. The situation 
changed slowly. In addition to the other departments medical 
and paramedical professionals have taken the necessary steps to 
keep them abreast of the latest technology as it is directly or 
indirectly connected with the profession. ERP function helps to 
achieve this. 

The concept of enterprise resource planning has made 
the process of segregating bills and patient records much easier. 
This industry also realized the need to adapt to ERP. Hence the 
industry was able to save more lives (but at the cost of an 
individual's privacy) as it enabled to access the database of 
patients and medical histories through the common database 
shared by hospitals and that too at a quicker rate. 

B. Avenues for earning 

The scope of medicine gets enlarged at a devastating 
rate. ERP architecture is an important factor in this area. What 
was considered as service in the yesteryears has become a 
service industry in commercial jargon. As hospitals multiply at 
a fast pace there is a constant urge to induce professionalism 
and best practices in the industry. 

In this context patients demand more than the previous 
days. They naturally want the best return for the money. When 
the industry is developing at a great speed with the two 
elements namely business and service gets stirred in 
proportions. And in this situation no hospital can continue with 
the old age practices and technologies and still charges heavily. 
Therefore it becomes imperative for hospitals to adopt the latest 
trends in terms of technology and acumen in order to retain 
customer loyalty. There is no need to advocate hospitals to get 
the latest medical equipments or follow the medical practices. It 
happens naturally otherwise their survival will become a 
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question mark. However after repeated experiences and acid 
tests hospitals have become prune to investing in latest 
technology like ERP and at the same time justify their cost. 

C. Pressure from patients 

Hospitals were not few and far between in the olden 
days. Hence the patients yielded to the demands of hospitals 
every time even if it meant lesser quality of treatment for the 
charges because they had no other choice. This is evident from 
ERP systems definition. 

The rate of hospital expansion is alarming. They have 
mushroomed every where. Patients are no more at the mercy of 
doctors but it is vice versa. When a patient is not satisfied he 
will not visit the hospital anymore. ERP systems definition will 
prove this better. 

When it comes to treatment there should not be more 
than minor discrepancies among hospitals that are off at the 
same standards. One parameter for measuring the quality of 
treatment is the technology involved in offering it. This 
technology not only creates a level of comfort but also helps the 
patient to be confident that he is receiving the best treatment. 

Consumers themselves demand tools like ERP and 
strongly rate it when it comes to the question of embarking with 
the hospital environment. There can be no wonder in saying 
that they have proved to be an important factor for influencing 
the hospitals to go for ERP. ERP review software helps them to 
decide the appropriate software. 

D. Reduces operational costs 

Enterprise resource planning helps to bring down the 
cost of operations. Based on the requirements the hospitals can 
go for best of breed or enterprise applications. This is an 
important step that helps hospitals to decide the appropriate 
software. 

Hospitals can reduce their overheads through ERP as it 
helps to integrate all functions namely accounts , finance , 
human resources and bring them systems under one common 
database on the basis of ERP architecture. 

III. ERP in the Healthcare 

ERP present an organization with operational and 
technical advantages, as well as a set of tangible and intangible 
financial benefits. In addition, ERP can provide number of 
benefits to health care organization. Human recourse can 
benefit from an ERP implementation due to centralized 
scheduling. In addition, for billing, laboratory, pharmacy and 
patient records can help in the anticipation of internal 
workflow. 
The planning process of ERP in hospital environments: 

ERP Administrator Plus integrated new generation 
hospital management software which converges latest 
technology and your administrative process to manage work 
process within the hospital. This is designed for multi- specialty 
hospitals, to cover a wide range of Hospital administration and 
management processes. It is an integrated client server 
application which uses Microsoft technologies as Front End and 
Flexible back end (like Oracle, SQL, etc). 
Objective: 

To provide an integrated Solution for the Hospital, which 
helps in Efficient Management of the Hospital? [11] 

• Enhance Patient Care 

• Improve work efficiency 

• Improve Fiscal Control 

• Eliminate the chances of any Pilferage 



• Enable the Growth of the Hospital ERP Administrator 

• Administrator Plus streamlines and integrate the 
operation processes and information flow in the hospital 
to synergize the resources namely men, material, money 
and equipments through information. 

• This ERP facilitates hospital-wide Integrated 
Information System covering all functional areas like 
out & in Patients Billing & Management, Patient Beds, 
Visiting Consultants, Medical Stores, Pathology 
Laboratories, Radiology Laboratories, Imaging, 
Pharmacy, Manpower Management, Kitchen and 
Laundry Services etc. 

• It performs core hospital activities and increases 
customer service thereby augmenting the overall 
Hospital Image. ERP bridges the information gap across 
the hospital. 

• Administrator Plus eliminates the most of the business 
problems like Material shortages, Productivity 
enhancements, Customer service, Cash Management, 
Inventory problems, Quality problems, Prompt delivery, 
Pilferage, TPA Billing etc. 

Key Features 

• Administrator plus (AP) is a fully-integrated, Hospital 
Information System Solution. 

• AP gives a total integration of order entry systems, 
administrative system, and departmental subsystems 
within a hospital. 

• AP allows for scalability, reliability, efficient data 
processing, quick decision making, reduced maintenance 
and overheads. 

• The data is stored in a single database providing real 
time data across applications throughout the hospital. 
Since all the data is stored centrally, it can be viewed 
simultaneously from multiple terminals giving all 
departments' access to 

timely, up-to-date patient information. 

• Administrator Plus offers a foolproof data security 
without user intervention to archive data. 

• AP is a comprehensive information system dealing with 
all aspects of information processing in a hospital. This 
encompasses human (and paper-based) information 
processing as well as data processing machines. 

• As an area of Medical Informatics the aim of Admin 
Plus is to achieve the best possible support of patient 
care and administration by electronic data processing. 

Modules 

As fig 1 ERP AP devised the following modules according to 
the requirements of a Multi Specialty Hospital and they 
integrate the various departments into a comprehensive system. 



Meeting the Needs : ERP Solution 
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Figure 1: ERP Modules 
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The various modules of ERP Administrator Plus are [8]: 

a. Reception Management 

Reception is the first point of interaction for anybody 
coming to the Hospital. It has all the information of the 
patients, doctors, departments and activities of the Hospital. All 
enquiries and appointments are scheduled through this module. 
All information available here are in real time and any enquiry 
about the patient status, Room Status, Doctors availability or 
tariffs for various services is on actual status since the data is 
constantly updated. 
This module comprises of the following components 

• Patient Enquiry: This will provide information of any 
patient like: Patient status, Name, Address or any other 
demographic detail. 

• Consultants Enquiry: Any Information regarding a 
visiting Consultant can be obtained like consultant's 
availability, days & time of availability, Department, 
specialization or any other. 

• Tariff Enquiry: This option can be used to enquire about 
the tariffs of the hospital. The tariffs are classified into 
departments. Enquiry about any service provided in the 
Hospital can be made. 

• Appointments Scheduling: This option allocates the 
slots for various consultants. Any appointment can be 
booked either by phone or visit, enquired and cancelled. 

• In-patient Enquiry: Any enquiry can be made for any 
indoor patient in the Hospital. The enquiry can be made 
as per the Name, Address, Department, Bed, Ward, 
Patient Registration Number, etc. 

b. Patient Registration 

Every patient who visits the hospital has to get 
registered prior to getting any consultation, treatment or 
investigations done. Registration of patients involves accepting 
certain general and demographic information about the patient. 
The patient is allocated a unique Registration number and a 
Patient Identification number. The Patient ID will remain same 
for his all subsequent visits to the hospital whereas he will be 
allocated a new registration number on every visit. The 
consultation charges (if applicable) can also be collected for the 
OPD patients during registration and a receipt will be 
generated. 

c. Out Patient Management 

After registration an OPD Card is printed for the OPD 
patients, which list all his registration information. This card is 
used for the prescription writing by the consultant. An 
Admission form is printed with all the registration details for 
Indoor patients, which serves as the cover page of the patient 
file. 

After the registration the patient comes to the 
consultation chamber, where the consultant records his history, 
diagnose and prescribe medicines & investigation. 
The Consultant note down the following details on Patients 
OPD Card like Complaints, History, Diagnosis, Investigation, 
Medicines, Advice and Next Visit 

This information is then entered into the patient data 
by the consultant or the operator at the OPD Counter. It serves 
the purpose of tracing patient's visits history and also as a 
feedback for research & analysis. The prescription can also be 
scanned and saved. The scanned data can be entered later into 
various fields by the operator. 

d. OPD Billing 

For billing of any OPD service like Pathology Tests, 
or any imaging investigation, the patient moves to OPD billing 
counter. Here the services are charged as per the rates already 



defined for various categories/ penal/ time etc to the patient 
with his Patient ID. The Payment is collected for the service 
provided and a receipt is generated. 

This module works as an interface with the diagnostic 
modules. All services will be automatically entered into the 
respective modules wherever required like lab & Imaging 
reporting. 

e. Investigation Reporting 

In the routine functioning of a hospital, various types 
of investigations are carried out. Carrying out number of tests 
and making the results available promptly is very crucial for 
assessing the patient's medical status and deciding on the further 
course of action. 

Investigation requisition can be auto -generated 
(through OPD billing or IPD) or can also be generated here, 
depending on the system followed in the hospital. The tests 
parameters are pre defined with the interpretations & formulae 
wherever applicable. The test results are entered into the 
software manually or with equipment integration and a 
descriptive smart report is printed after verification and 
validation. 

f. Indoor Patients Management 

The Indoor patient module commences when the 
patient is being registered and allotted a bed in the ward. It 
deals with the complete treatment and services provided to the 
patient during his stay in the hospital. 

This module works at the nursing station. During his stay in the 
hospital, every patient is provided various services in terms of 
consultant's visits, investigations, procedures, medicines & 
consumable, room services, diet, etc. All these services are 
entered online to the patient record through nursing station. It 
also interacts with the Investigation module, Store, Pharmacy 
and sends the requisitions to these departments. This data serves 
as major input for the IPD billing. 
IPD Billing 

Indoor billing module has a supervisory role. The entries for 
billing are automatically transferred to the patient bill by the 
respective departments, which provide the service. The services 
are charged as per the category/panel/package applicable. 
Here the bill is compiled and the payment collected from time 
to time. Provisional and Final bills are generated which 
provides complete information about the Services availed, its 
Charges, Advance collected, appropriate Receipts, Refunds, 
Credit notes, Concession allowed, etc. 

Optional Modules 

• Hospital Store 

This module deals with the inventory of all Hospital 
Equipments, Materials, Consumables, and Medicines, 
Implants & Asset items in different departments of the 
hospital along with their purchase and supplier details. 
Requisitions for different items/equipment are sent to this 
store from different departments and accordingly the Central 
Store issues items/equipment to various departments and 
generate purchase orders for purchases. This also maintains 
records of purchases, stock, and supplier list, 
item/equipment/material master tables. 

The Store module ensures that there is a round the 
clock availability of a sufficient quantity of drugs and 
consumable material for the patients in a mode that neither 
hinders efficient clinical work, nor it becomes a threat to the 
survival of the Store. 
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• Pharmacy 

The Pharmacy Module deals with the Retail Sale of 
medicines to OPD patients and issue of medicines to the In- 
patients in the hospital. Its function includes, online drug 
prescription, inventory management and billing of drugs, 
consumables and sutures. This module is closely linked to 
the Billing Module and In-patient Module. All the drugs 
required by the patient can be indented from the various sub 
stores. 

• Financial Accounting 

A Financial accounting module is linked with hospital 
billing module. You get online accounting of all revenue 
generated along with expenses incurred. There is no need to 
enter the revenue entries as they are already fetched from the 
billing module. All relevant information for the staff 
salary/wages, consultant share, etc is available. 

• Payroll 

This module Keeps track of all staff member's 
attendance; there leave record and deductions. Generate 
salary slip and other related reports. 

• Canteen and Diet 

This module deals with the billing of all canteen items 
and issue to Inpatients or staff. This will work as OPD 
billing. The rates of all items will be defined as per the 
category (general/Inpatient/staff) and the items issued to 
them are billed respectively. Inpatients bill can be reflected 
in their hospital Bill. 

• MRD Management 

Patient's Medical record data is critical for the analysis 
and research purposes. This data includes patient history, 
observation, diagnosis and therapeutic conclusions along 
with the tests details and serves as a source of information 
for any analysis and research. The purpose for this module is 
to utilize the patient's medical information and use it for 
analysis thereby improving patient care [8]. 

• Online Reporting 

This module transfers all the diagnostic reports 
performed in the day to the hospital website where the 
patient can view or download the report using the password. 
This has special significance for telemedicine. 

• TPA Management 

Keep track of all patients from a particular penal, their 
authorization amount and payment status with full bill 
details. Special rates for any particular penal will be 
automatically calculated. 

• Imaging Records 

Image capture and analysis are vital to the use of 
Nuclear, CT scan, Laparoscope's, Endoscopes, Colposcopes, C- 
Arm & Ultrasound Systems in medicine as shown in fig 2 & 3. 



print quality from digital medical modalities with both color 
and grayscale outputs with normal DeskJet or Laser printers. 
With our software's outstanding, High Quality Images, Superb 
Color Reproduction, Live Recording & Web Based utility to 
take 2nd opinion. Thus it can be used for any medical / 
scientific application or radiology solutions with digital 
interface and for archiving or educational purposes. 



• Main Features 

• Can capture images indifferent formats. 

• Recording facility, to record procedures or undergoing 
operations. 

• Procedure/Operations done Live can be given on CD. 

• Workstation can be created, at one point, for different 
modalities. 

• Data base to re-analyze and to keep patient records 
updated. 

• Can create Editing, filtering of colors(R, G, B) on the 
images. 

• Deferent report formats / Default formats for each 
modality. 

• Comparison mode is available. 

• Exclusive help menu, with brief knowledge about the 
equipment & S/W. 

• with voice recording. 

• Clipping, Splitting and Merging of different clips 
(Video). 

• Test overlay on a recorded video to mark abnormalities 
to built presentations when you need it most. 




Figure 3: Image Record 2 

• Special Features 

• Software can be operated by Remote Key. 

• Software can be operated by Foot Switch. 

• Can E-mail, Report with images to take 2nd opinion. 

• OT Management 
Detailed breakup of operation charges, OT 

consumables, anesthetist charges, etc from a separate 
module. OT scheduling for the patients is also done. 

• Online MIS 
This is a unique tool in the hands of top management. 

This module shows a screen where all current activities in 
terms of revenue are displayed and this screen is refreshed 
automatically after every 10 seconds. Simply saying, by 
looking at this screen, top management can find out the 
collection of various departments in OPD & Indoor, 
Outstanding of General and panel patients, No. Of 
admissions, Discharge of the day and so on. 

IV. Conclusion 

ERP systems have become vital strategic tools in today's 

competitive business environment. The paper helps to 

demonstrate how ERP is useful in healthcare system, what are 
Digital Imaging Software dedicated for each Modality with 123me benefits of mis system? how ERp works when implemented 

Auto report generation utility is designed to achieve optimum 




Figure 2: Image Record 1 
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in hospitals so that more hospitals can adopt this technology for 
there betterment and advancement and also work with it. 
Ultimate Aim - Better Patient Care with Efficiency through 
ERP implementation. 
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Abstract- In this paper we will introduce an expert 
system for evaluation of online exam. We use fuzzy 
system for classifying students based on their usage 
data and the final marks obtained in their respective 
courses. We have used real data from nine Moodle 
courses with Mansoura University Pharmacy students 
and apply techniques on two hundred students. This 
expert system will be able to facilitate education and 
play the role to play the role of virtual intelligent 
teacher referring to student capabilities by following 
the feedback mechanisms and will evaluate the online 
exams and questions to measure the difficulty level of 
exams. 

The main components of this expert system are 
Inference Engine, Knowledge Acquisition Facility and 
Knowledge-base that construct back-end of the 
system. We realize the model by a fuzzy rule-based 
expert system with its inference engine that uses 
various inference methods for education. 
Keywords: Fuzzy rule base, Knowledge base, Inference 



engine 



I. INTRODUCTION 



Modern information management systems 
enable the recording and the management of data 
using sophisticated data models and a rich set of 
management tools. In the context of educational 
systems, the information typically includes details 
about learning material, the tasks and the objectives, 
the course information, the contact information, the 
teacher and the student profiles, and the information 
related to student assignments, the tests, the grades, 
and other records [1]. 

In this paper, we seek means to model the 
imprecision of information and simplify the access 
to information systems, in terms of fuzzy modeling. 
The paper is organized as follows. Section 2 
presents the knowledge base of expert system for 
student grades assessment Section 3 discuss the 
components of fuzzy rule based controller. Section 4 



describes the results of the assessment students and 
online exams. Section 5 shows the feedback of 
evaluating students and online exams. Section 6 
concludes the work and indicates future research 
directions. 

II. Knowledgebase expert system 

All needed data is acquired from a teacher and 
stored in a "Knowledge Based" which should be 
able to face student training and their up growing 
problem. [2] We use knowledge base expert system 
as follow to model knowledge domain "fig.l" 




Figure 1. Knowledge base expert system 

Global Data Base consists of student state variables, 
teaching state variables and exam state variables 

• Student state variable (xi): 
xl -Exercise grade for every student 
x2-Exam grade for every student 
x3- Interesting course 

x4- Student level 

• Teaching state variable (yi): 

yl- Difficulty level exam 

y2-Teaching content 

y 3 -Teaching method 

y4-Teaching schedule 

y5- Degree of course usages 

y6- Degree of creating motivation. 
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• Exam state variable (zi) 

zl- Exam average grade 

z2- Exam level 

Fuzzy logic was primarily designed to represent and 
reason with some particular form of knowledge. 
Fuzzy logic is powerful problem solving 
methodology with a myriad of applications in 
embedded control and information processing. 
Fuzzy systems are mathematically based systems 
that enable computers to deal with 
imprecise, ambiguous, or uncertain information and 
situations. 

Fuzzy set theory was proposed in 1965 by 
Zadeh to help computers reason with uncertain 
and ambiguous information. Zadeh proposed 
fuzzy technology as a means to model the 
uncertainty of natural language [3]. He 
reasoned that many difficult problems can be 
expressed much more easily in terms of 
linguistic variables. Linguistic variables are 
words and attributes which are used to describe 
certain aspects of the real world. One 
important feature of linguistic variables is the 
notion of their utility as an expression of data 
compression. Zadeh describes this as 
compression granulation. He argues that this is 
important because it is more general than use 
of discrete values. This point means that an 
agent using linguistic variables may be able to 
deal with more continuous and robust 
descriptions of reality and problem spaces. Our 
approach is to design a fuzzy rule base system 
to control training process. 

III. FUZZY RULE BASED 

This system is designed for evaluating and 
teaching the students so that the resulting control 
system will reliably and safely achieve high 
performance operation. 

A block diagram of fuzzy system is shown in 
"Fig. 2" Basically in fuzzy control system, there are 
four major stages to accomplish the control process: 
[4] 

• Fuzzy input and output variables & their 
fuzzy value 

• Fuzzy rule base 

• Fuzzy inference engine 

• Fuzzification and defuzzification modules 

Fuzzy Controller 
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A Fuzzy Inference Process 

A fuzzy system works similar to a 
conventional system: it accepts an input value, 
performs some calculations, and generates an output 
value. This process is called the Fuzzy Inference 
Process and works in three steps illustrated in 
"Fig.3" [5]: 

• Fuzzification where a crisp input is translated 
into a fuzzy value. 

• Rule Evaluation, where the fuzzy output truth 
values are computed, and 

• Defuzzification where the fuzzy output is 
translated to a crisp value. 



Crisp Input 



Fuzzification 



Fuzzy Input 



Rule Evaluation 



Fuzzy Output 



Defuzzification 



Crisp Output 



Figure2. Fuzzy System 



Figure3 . Fuzzy System Process 

B. Fuzzification 

Fuzzification where a crisp input is translated 
into a fuzzy value. 

The membership functions defined on the input 
variables are applied to their actual values to 
determining the degree of truth. 
For example for the fuzzification crisp inputs, xl 
and yl and determine the degree to which these 
inputs belong to each of the appropriate fuzzy sets 
(Figure 3). 

At first it gets inputs and then fuzzifies them. After 
fuzzification, make decision through fuzzy inference 
engine according to fuzzy rule based system. 

C. The Fuzzy Inference engine fuzzy rule based: 

This is an interface for fuzzifying the user- 
requested parameters of the test items. The fuzzified 
parameters, along with a set of fuzzy rules, are then 
sent to an expert system to perform the inference 
process. 
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D. Defuzzification 

Defuzzification is a process of converting output 
fuzzy variable into a unique number. 

Defuzzification process has the capability to reduce 
a fuzzy set into a crisp single-valued quantity or into 
a crisp set; to convert a fuzzy matrix into a crisp 
matrix; or to convert a fuzzy number into a crisp 
number. [6] 

IV. EXPRIMENTS AND RESULTS 

A Evaluating an online exam 

We consider two fuzzy input variables as exam 
average grade zl (Fig4.a) and difficulty level of 
exam yl (Fig4.b). And the output will be the exam 
level (z2). Membership function of zl, yl and z2 
should be as follows (0 < |u< 1).[7] 
Fig 4-a 



:.u 



R9: If zl is low and yl is low Then z2 is difficult 



TABLE 1. Fuzzy Rules for exam evaluation 



:.: 



CO 




10 20 10 40 50 60 70 80 90 100 

Fig 4-b 




:.o 



0J 



0.0 

10 20 30 40 50 60 70 80 90 100 

Figure4. Membership function of exam average 
grade (zl) and difficulty level (yl) 

Fuzzy rule base for evaluating exam is designed as 

follows: 

Rl: If zl is high and yl is high Then z2 is easy 

R2: If zl is high and yl is medium Then z2 is 

moderate 

R3: If zl is high and yl is low Then z2 is moderate 

R4: If zl is medium and yl is high Then z2 is easy 

R5: If zl is medium and yl is medium Then z2 is 

moderate 

R6: If zl is medium and yl is low Then z2 is 

difficult 

R7: If zl is low and yl is high Then z2 is moderate 

R8: If zl is low and yl is medium Then z2 is 

difficult 





Low 


Medium 


High 


Low 


Difficult 


Difficult 


Moderate 


Medium 


Difficult 


Moderate 


Easy 


high 


Moderate 


Moderate 


Easy 



For Exam "Cyptology" course: 

1- Exam average grade for all students (exam 
grade) is calculated as: 

(Sum of students grades)/no of students 

Exam average grade = 66% 

2- Exam difficulty level 

Following formula for calculating the Exam 
Difficulty: [8] 

, P 

fl ■- 

Where a denoted the item difficulty, P denoted 
the number of examinees that answered the item 

correctly, and n denoted the total number of 
examinees. Exam difficulty level = 62% 

First we can apply fuzzification where a crisp input 

is translated into a fuzzy value, 

By applying Triangle Membership Function for 

"Fig 4.a" 

uA(zl) = 60% uA(yl) = 20% 

uB(zl) = 20% uB(yl) = 80% 

By applying inference mechanism 

if uA(xl) = 60% and uA(y 1) = 20% then 

uA(z2) = 20% 

if uA(zl) = 60% and uB(yl) = 80% then 

uA(z2)= 60% 

if uB(xl) = 40% and uA(yl) = 20% then 

uA(z2) = 20% 

if uB(zl) = 40% and uB(yl) = 80% then 

uA(z2) = 40% 

Second: Rule Evaluation, where the fuzzy output 

truth values are computed. 

According to fuzzy based rule, we find 

Az2 = uA(z2) = 20% "Easy" Rule 1 
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Az2 = uB(z2) = 60% "Moderate" Rule 2 

Third: We will apply defuzzifi cation where the 

fuzzy output is translated to a crisp value[9] shown 

in (figure 5). 

The center of gravity is calculated as follow: 

C0G = Jq%AP)xdx _ 0.6*60+0,2*70 = 



J a [iA(x)dx 



0.2+0.6 




Final Exam Level By 
Centroid Defuzzificalion 



Final Exam Level 



Figure5. Defuzzification Result for Exam 
Evaluation 

Then the exam level is "Moderate". 

B. Evaluating students 

We consider two fuzzy input variables as exam 

grade x3 (figure 6. a) and difficulty level of exam yl 

(figure 6.b) and the output will be the student level 

(x4). Membership function of x3, yl and x4 should 

be as follows (0 < jlx< 1). 



Fig 6-a 

Moderate High 



Very High 




40 60 

Fig 6-b 

Moderate High 



Very High 




Figure 6. Membership function of student grade 
(xl) and difficulty level (yl) 



Fuzzy rule base for evaluating student is designed as 
follows: 

Rl: If xl is low and yl is low Then x4 is fail 

R2: If xl is low and yl is medium Then x4 is fail 

R3: If xl is low and yl is high Then x4 is pass 

R4: If xl is low and yl is very high Then x4 is pass 

R5: If xl is medium and y 1 is low Then x4 is fail 

R6: If xl is medium and yl is medium Then x4 is 
pass 

R7: If xl is medium and yl is high Then x4 is good 

R8: If xl is medium and yl is very high Then x4 is 
good 

R9: If xl is high and yl is low Then x4 is pass 

RIO: If xl is high and y 1 is medium Then x4 is pass 

Rl 1 If xl is high and y 1 is high Then x4 is good 

R12: If xl is high and yl is very high Then x4 is 
excellent 

R13: If xl is very high and yl is low Then x4 is 
pass 

R14: If xl is very high and yl is medium Then x4 is 
good 

R15: If xl is very high and yl is high Then x4 is 
excellent 

R16: If xl is very high and yl is very high Then x4 
is excellent 

TABLE 2. Fuzzy Rules for student evaluation 





Low 


Medium 


High 


Very High 


Low 


Fail 


Fail 


Pass 


Pass 


Mediu 
m 


Fail 


Pass 


Good 


Good 


high 


Pass 


Pass 


Good 


Excellent 


Very 
High 


Pass 


Good 


Excellent 


Excellent 



For student "student ID 1008": 

1- Exam grade for this student is : 

Student grade = 77% 
2-Exam difficulty level 
Exam difficulty level = 62% 

First we can apply fuzzification where a crisp input 
is translated into a fuzzy value, 
By applying Triangle Membership Function for Fig 
6-a 

uA(xl) = 10% uA (yl) = 85% 

uB(xl) = 90% uB (yl) = 15% 

By applying inference mechanism 
if uA(xl) = 10% and uA(yl) = 85% then 
uA(x4)=10% 
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if uA(xl) = 10% and uB(yl) = 15% then 

uA(x4) = 10% 

if uB(xl) = 90% and uA(yl) = 85% then 

uA(x4) = 85% 

if uB(xl) = 90% and \iB(yl) = 15% then 

uA(z2)=15% 

Second: Rule Evaluation, where the fuzzy output 
truth values are computed. 
According to fuzzy based rule, we can use 
Ax4 = uA(x4) = 10% "Excellent" Rule 16 
Ax4 = uB(x4) = 85% "Excellent" Rule 12 

Third: We will apply defuzzifi cation where the 

fuzzy output is translated to a crisp value. 

COG = 78 

Then the student level is "Excellent". 

V. FEEDBACK FOR EVALUATING 

SRUDENTS AND ONLINE EXAM 

We can classify exams according to our 
expert system in to 3 levels: Easy, Moderate and 
Difficult. [10] Then we have exam store for 
Pharmacy students for Mansoura University so we 
can evaluate this exams and give the feedback to the 
instructor to be a good reference for exam 
evaluation, so the results is as follow in Table [3] : 



TABLE3. Results for exam evaluation 



TABLE 4. Results for student evaluation 



Course 


Level 


Cartilage and Bone Online Exam 


Difficult 


Cytology for Clinical Pharmacy Exam 


Difficult 


Group I Online Exam of Immune 
System 


Easy 


CVS Online Exam 


Moderate 


Urinary, Male, and Female Online 
Exam 


Moderate 


Online Exam of Muscular Tissue 


Easy 


Modifications, Glands & CT Exam 


Difficult 


Med-Term Exam for Clinical 
Pharmacy 


Moderate 


Second Med-Term Exam for CP 


Difficult 



The student assessment is very important because a 
good assessment let the instructor to have a correct 
decision for student follow up. 

We classify student in to 4 levels: fail, pass, good 
and excellent. [11] So according to "Cyptology" 
Course the students level are as shown in Table 



Student ID 


Grade 


Level 


1006 


63% 


Good 


1007 


87% 


Excellent 


1008 


77% 


Excellent 


1009 


52% 


Pass 


1011 


38% 


Fail 


1014 


83% 


Excellent 


1052 


60% 


Good 


1062 


30% 


Fail 



CONCOLUSION 

Fuzzy expert system and fuzzy rule based is a great 
step forward for the adaptation of the accessible 
knowledge for the student according to the feedback 
obtain from the evaluating system. 
It's also considered a good reference for instructor to 
evaluate the exam level and the quality assurance 
organization is benefit from this evaluation. 
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Abstract — This paper focuses on the feasibility of Neural Network 
controller for Networked Control Systems. The Intelligent 
Controllers has been developed for controlling the speed of the 
Networked DC Motor by exploiting the features of Neural 
Networks and Fuzzy Logic Controllers. The major challenges in 
Networked Control Systems are the network induced delays and 
data packet losses in the closed loop. These challenges degrade 
the performance and destabilize the systems. The aim of the 
proposed Neural Network Controller and Fuzzy Logic Controller 
schemes improve the performance of the networked DC motor 
and also compare the results with the Zeigler-Nichols tuned 
Proportional-Integral-Derivative Controller. The performance of 
the proposed controllers has been verified through simulation 
using MATLAB/SIMULINK package. The effective results show 
that the performance of networked dc motor is improved by 
using Intelligent Controller than the other controllers. 

Keywords- Networked Control Systems (NCS); Network 
Challenges; Tuning; Proportional - Integral - Derivative 
Controllers (PID); Fuzzy Logic Controller (FLC); Artificial Neural 
Networks (ANN). 



I. 



Introduction 



Networked Control System is the adaptation of 
communication network for information exchange between 
controllers, sensors and actuators to realize a closed control 
loop. Networks reduce the complexity in wiring connections 
and the costs of Medias. They are easy to maintain and also 
enable remote data transfer and data exchanges among users. 
Because of these benefits, many industries and institutions has 
shown interest in applying different types of networks for their 
remote industrial control and automation. Regardless of the 
types of networks, the overall performance of NCS is affected 
by two major challenges as networked induced delay and data 
losses. The challenges of networked DC motor are generally 
controlled by Conventional Proportional - Integral - 
Derivative Controllers, since they are less expensive with 
inexpensive maintenance, designed easily, and very effective. 
But mathematical model of the controller and tuning of PID 
parameters are difficult and generally not used for non-linear 
systems. Hence to overcome these challenges auto-tuning and 
adaptive PID Controller was developed with few mathematical 
calculations. The Intelligent controllers as Fuzzy Logic 
Controller and Artificial Neural Networks were used to 
overcoming the challenges. Thus this paper proposes 
Intelligent Controller for the compensation of the challenges. 



The novelty of this paper lies in comparison of the application 
of NARMA-L2 Controller and Mamdani Fuzzy Logic 
Controller with conventional PID controller for the 
improvement of the performance of networked control DC 
motor. 

There are two approaches to utilize a data network as 
Hierarchical Structure and Direct Structure as shown in Fig. 1 
and Fig. 2 respectively. In the hierarchical structure the dc 
motor is controlled by its remote controller at remote station 
whereas in direct structure the central controller is used for 
controlling the speed of dc motor. Since the hierarchical 
structure has a poor interaction between central and remote 
unit, direct structure is preferred. 

Recently the stability analysis and control design for NCS 
have attracted considerable research interest [3], [4], [6] and 
[11]. The work of Nesic and Teel [2] presents an approach for 
stability analysis of NCS that decouples the scheduling 
protocol from properties of network free nominal closed-loop 
system. Nesic and Tabbara [3] extended [2] by stochastic 
deterministic protocols in the presence of random packet 
dropouts and inter transmission time and they also proposed 
wireless scheduling protocol for non-linear NCS in [6]. The 
networked predictive control scheme for forward and feedback 
channels having random network delay was proposed in [4], 
and [5] addresses the problems of how uncertain delays are 
smaller than one sampling period which affects the stability of 
the NCS and how these delays interact with maximum 
allowable transfer interval and the selected sampling period. 
Robust feedback controller design for NCS with uncertainty in 
the system model and the network induced delay has been 
addressed in [7] -[8], whereas [9] handles networked induction 
motor speed control by using linear matrix n equality (LMI) 
method. Ref. [1] measure the networked vehicle control 
performance using an H infinity norm with linear matrix 
inequalities conditions and markovian jumping parameters in 
communication losses. In case of time varying transmission 
times, model based NCSs has been proposed for stabilization 
problem of NCS. The stability analysis and controller 
synthesis problems are investigated in [11] for the NCSs with 
random packet losses by using H infinity control and linear 
matrix inequalities. A moving horizon method was developed 
by [12], which was applied as a quantized NCS in a practical 
context. Since these methods transmit data specifying only a 
region in which the measurements lie, it will reduce the 



131 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol.8,No.8,2010 



network stabilization of the NCS. However, this method could 
reduce the stability of the control system by introducing 
uncertainty in the control system. The issues of limited 
bandwidth, time delay and data dropouts was taken into 
consideration when NCSs controllers were designed in [12] - 
[14]. The networked control system performance depends on 
the control algorithm and the network conditions. Several 
network conditions such as bandwidth, end-to-end delay, and 
packet loss rate are major impacts on networked control 
systems. Depending upon the control algorithm and network 
conditions the overall performance of the networked system 
may vary and hence the stability of the system. 



II. 



Modelling 



A networked control system can be divided into the 
remote unit, the central controller and the data network. Fig. 3 
shows the general block diagram of the networked control 
system under investigation. In order to focus our discussion on 
the performance of networked closed loop control system with 
network conditions (delay, data loss), a networked dc motor 
control system has been illustrated. 

A. Remote Unit 

The Remote Unit consists of the plant (dc motor), sensor 
and an interfacing unit. Via the network the remote unit can 
send measurements like motor speed, current, temperature, 
and local environment information, back to the central 
controller. The electro-mechanical dynamics of the dc motor 
can be described by the loop equation as first order differential 
equations. 

di 
u(t) = e a =L-± + Ri a +e b 

at (!) 




Figure 1. Hierarchical Structure. 



Control Signal 




Soect: measurement 



M:Motor 

S;Sonor 

T: Interfile 

CGCtutral Coa&oJlar 
C:R«oi0t<: Cootolltr 



Central 
Controller 




Remote Unit 
(DC Motor) 



Figure 3 . An overall real-time networked control system. 

where u=e a is the armature winding input voltage; e b =K b oo is 
the back-electromotive-force (EMF) voltage; L is the armature 
winding inductance; i a is the armature winding current; R is 
the armature winding resistance; K b is the back-EMF constant 
and cd is the rotor angular speed. Based on Newton's law the 
mechanical-torque balance equation is 

J— + Bo) + T l =Ki a 

dt ( 2 ) 

J is the system moment of inertia; B is the system damping 
coefficient; K is the torque constant and Ti is the load torque. 
By letting x x = i a and x 2 = co, the electromechanical dynamics 
of the dc motor can be described by the following state-space 
description: 

R K h 



xi(0 = 



1 

+—u 
L 



1 



• KB 

J l J 2 L 



(3) 



(4) 



The parameters of the motor Table 1 are used for determine 
the state space model of dc motor. 



TABLE 1 . DC Motor Parameters 



J 


Moment of Inertia 


42.6 e-6 Kg-m 2 


L 


Inductance 


170 e-3 H 


R 


Resistance 


4.67 n 


B 


Damping Coefficient 


47.8 e-6 Nm-sec/rad 


K 


Torque Constant 


14.7 e-3 Nm/A 


K b 


Back EMF constant 


14.7 e-3 Vsec/rad 



B. Central Controller 

The central controller will provide the control signal u c (t) 
to the remote systems. The central controller will monitor the 
network conditions of the remote unit link and provide 
appropriate control signals to each remote unit. Similarly the 
output responses are taken as feedback signal y R (t) to the 
central controller. The proposed Intelligent Controllers will 
compensate the network-induced delays, data losses and 
external disturbances. The data losses and disturbances occur 
due to missing or disturbances in input reference signal, control 
signal and feedback signal. 



Figure 2. Direct Structure. 
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C. Data Network 

There are different ways to define network conditions for 
point-to-point (from the central control to a specific remote 
unit). Two of the most popular network measures are the 
point-to-point network throughput and maximal delay bound 
of the largest data. One factor of interest is the sampling time. 

To keep the illustration simple, the remote unit receives the 
data sent from the central controller as u R (t), which can be 
mathematically expressed as 

U R (t) = U c (t-T R ) (5) 

where x R is the time delay to transmit the control signal u c (t) 
from the central controller to the remote unit. The remote unit 
also sends the sensors signals y R (t) of the remote system back 
to the central controller yc(t), and these two signals are related 
as 



yc( t ) = y R (t-T c ) 



(6) 



where x c is the time delay to transmit the measured signal 
from the remote unit to the central controller. There are also 
processing delays as x PC and x PR , at the central and remote unit, 
respectively which could be approximate small constants or 
even neglected because these delays are usually small 
compared to x c and x R . 

The functions of network variables such as the network 
throughput, the network management/policy used, the type 
and number of signals to be transmitted, the network protocol 
used, and the controller processing time, and the network 
traffic congestion condition are taken as the current network 
conditions n(t) and let z _t be a time delay operator which 
defines the signals as 

u R (t) = u c (z~ tR ,n(t)) 



(7) 

(8) 

In this paper, we have chosen sampling time as 0.5 ms and 
simulations are done. 



y c (t) = y R (z t %n(t)) 



III. Modelling Controller Design For NCS 

In this session the proposed Neural Network Controller and 
Fuzzy Logic Controller as the central controller is described 
and the results are compared with the PID controller. 

A. Neural Network Controller 

The proposed scheme utilizes the neural-network 
NARMA-L2 Controller. The Neural Network Controller is 
designed to take the error as the input and computes the output 
stabilizing signal depending on the input error signal. The 
block diagram of Neural Network Controller for NCS is 
shown in Fig. 4. 

The NARMA-L2 controller, a multilayer neural network 
has two steps involved as system identification stage and 
control design stage. In system identification stage, a neural 
network model of the plant which has to be controlled is 
developed and in later stage the neural network plant model 
has been designed to train the controller. The ANN plant 
specification has been shown in Table 2. 




Neural Network 
Controller 




Motor 
Speed 



Figure 4. Neural Network Controller for NCS. 
TABLE 2. ANN Plant Specification 



. of Inputs 


3 


. of Outputs 


2 


. of Hidden Layers 


2 


. of Training Samples 


1000 


. of Training Epochs 


200 



The error signals are trained for number of epochs by using 
the NARMA-L2 controller and the control signal are generated 
for any challenges in the network. 

B. Fuzzy Logic controller 

In general, fuzzy logic control is used for the control of a 
plant where the plant modeling is difficult. For such systems 
that are difficult to model, fuzzy logic controller has been 
successful by Mamdani. The basic principle of fuzzy logic lies 
in the definition of a set where any element can belong to a set 
with a certain degree of membership. Using this idea, the 
knowledge of an expert can be expressed in a relatively simple 
form and the inference for given inputs can be implemented 
very efficiently. Due to these advantages, fuzzy logic control 
is an attractive method for NCS whose modeling is very 
difficult because of the stochastic and discrete nature of the 
network. Fig. 5 shows the structure of FLC for a single input 
single output plant. In Fig. 5 r(t) is the reference input, y(t) is 
the plant output, e(t) is the error signal between the reference 
input and plant output and u c (t) is the control signal. 

The FLC consists of three parts as 1) Fuzzifier that 
converts the error signal into linguistic values, 2) Inference 
engine that creates the fuzzy output using fuzzy control rules 
generated from expert experience and 3) Defuzzifier that 
calculate the control input to the plant from the inferred 
results. The input and output signals to the FLC are error 
signal e(t) and control signal u c (t) respectively. In this paper, 
the trapezoidal fuzzy members are selected for membership 
functions. Three fuzzy linguistic variables, i.e., Small, 
Medium and Large are defined. The coefficients of the 
membership function depend upon the set point and are 
determined by several trial and error experiments with the 
plant without the network. In order for faster execution of the 
fuzzy logic controller, the Mamdani' s min-max inference 
method and the central average defuzzifier are used. 

The rules used in this paper are as 

If e(t) is small then u c (t) is small 
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If e(t) is medium then u c (t) is medium 
If e(t) is large then u c (t) is large 

C PID Controller 

It is used to compute the control signal to the remote dc 
motor for step tracking, based on the monitored system signals 
sent from the remote unit via the network link as in Fig. 6. The 
Proportional-Integral-Derivative (PID) controller used is 

de(t) 
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IV. Simulation Setup and Results 



U PID (0 = K p e(t) + Kj ^e{t)dt + K L 



dt 



(9) 



where K P is the proportional gain; K T is the integral gain; K D is 
the derivative gain; r(t) is the reference signal for the system to 
track; y(t) is the system output; and e(t) is the error function. 
In our case, y = oo is the motor speed, and U PID (t) is the input 
voltage to the motor system. 

The results of model system with ANN, FLC and PID 
Controllers for network induced delays, losses and disturbance 
are simulated and compared. 




In the simulation scenario, the direct structure of the 
networked DC motor control system is simulated using 
MATLAB/ SIMULINK under fully controlled environments 
for Neural Network Controller, Fuzzy Logic Controller and 
PID Controller. Equations (3) - (4) are used as the main 
model, and it is controlled by the controller with the insertions 
of network delays according to (5) - (6). The delays are varied 
according to different effects of interests. The disturbance and 
loss of input signal, control signal and the feedback signal 
were made for few milliseconds at each stage and the results 
were studied. The system setup is illustrated in Fig.4, Fig. 5 
and Fig. 6. Using (3)-(4) and Table 1, the state model of the dc 
motor is obtained. Then the results of the ANN and FLC are 
compared with the PID controller. 

Output Responses of the system are obtained for all 
controllers used in this paper. Fig. 7 shows the comparison of 
the system performance for all controllers without delays and 
data losses. 

Fig. 8-10 shows the response of the system for the 
controllers with different network induced delays and the 
comparison of these performances are tabulated in Table 3. The 
system responses with delay and data losses are obtained as in 
Fig. 11. From the simulation results as in Fig. 7 - 11, the 
overall system performance with Intelligent Controllers as 
ANN and FLC are better than the PID controller. 

Output Response 
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Figure 7. Comparison of System Responses for ANN, FLC and PID 
Controller without delay and losses. 



TABLE 3. Comparison of performance of the networked dc motor control system with delay in ANN, FLC and PID Controller. 

(Set point = 1500 rpm; Sampling Time = 0.5ms) 
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Figure 8. Response of the System using PID Controller with varying delays in forward and feedback path of NCS. 
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Figure 9. Response of the System using FLC with varying delays in forward and feedback path of NCS. 
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Figure 10. Output Response of the System using ANN with varying delays in forward and feedback path of NCS. 
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Figure 11. Comparison of system responses of ANN, FLC and PID Controllers with delay and losses. 



V. Conclusion 

Networks and their applications play a promising role for 
real-time high performance networked control in industrial 
applications. The major concerns are the network induced 
delays and data losses that are provided by the network which 
affects the performance of the networked control systems. This 
paper has describes and formulates the Intelligent Controllers 
as Neural Network Controller and Fuzzy Logic Controller in a 
networked DC motor control. The numerical result are obtained 
and compared for Neural Network Controller, Fuzzy Logic 
Controller and PID Controller. The effective results show that 
the performance of networked control DC motor is improved 
by using Intelligent Controller than the convention controller in 
all network variations and deteriorations. The analysis on using 
intelligent controls improves and strengthens the networked 
control systems concepts in the future. 
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A Novel LTCC Bandpass Filter for UWB 

Applications 
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Abstract — Bandpass filter based on parallel coupled line 
microstrip structure is designed in low -temperature co-fired 
ceramic technology (LTCC) suitable for short range Ultra- 
Wideband (UWB) applications. Fifth order Chebyshev filter of 
0.05 dB passband ripple with fractional bandwidth of 62.17% is 
proposed using insertion loss method. The filter demonstrates 
-10 dB bandwidth and linear phase response over the frequency 
range 3.8 GHz - 7.4 GHz. With the above functional features, the 
overall dimension of the filter is 33.5 mm (height) x 1.6 mm 
(length) x 1.6 mm (breadth). It is not only compact but also 
delivers excellent scattering parameters with the magnitude of 
insertion loss, IS 2 il lower than -0.09 dB and return loss better 
than -49 dB. In the passband, the computed group delay is well 
within the tolerable variation of 0.1 ns. 

Keywords- Ultra-wideband; bandpass filter; parallel coupled 
line; low -temperature co-fired ceramic; group delay 



gap size are required. Obviously, shrinking the gap size is not 
only the way to increase the coupling of coupled lines [10]. 

The proposed bandpass filter in this paper is based on 
LTCC using parallel coupling at center and broad side 
coupling at ends of the proposed filter structure. The filter is 
designed to cover the entire UWB range. The main advantage 
of the multi-layered structure is to shrink the circuit size. The 
obtained scattering parameters of UWB bandpass filter convey 
an optimal performance in terms of insertion and return loss. 
It is distinctive in its structure and it has simple design with 
less number of design parameters compared to the existing 
filter designs in the literature [11] -[13]. 

The rest of the paper is organized as follows: In Section II, 
the UWB bandpass filter design using LTCC is presented. 
Simulation results and analysis are presented in Section III. 
Section IV concludes the paper. 



I. Introduction 

UWB technology has brought out tremendously increasing 
research interests since the Federal Communications 
Commission (FCC) in USA released its unlicensed use for 
indoor and hand-held systems in 2002 [1]. Efforts have been 
made in the past eight years towards exploring various UWB 
components and devices. As one of the essential component 
blocks, the researchers are attempting to design the UWB 
bandpass filter (BPF) with 120% fractional bandwidth 
centered at 6.85 GHz. In the recent years, the market pays 
much attention towards miniaturization of receiver systems. 
Hence, researchers are working for the development of small 
size and cost effective filters [2]-[5]. 

Parallel coupled-line microstrip filters are found to be one 
of the most commonly used microwave filters in many 
practical wireless systems for several decades [6] -[8]. In 
addition to the planar structure and relatively wide bandwidth, 
the major advantage of this kind of filter is that its design 
procedure is quite simple. Based on insertion loss method [9], 
filter functions of maximally flat and Chebyshev type can be 
easily synthesized. Moreover, the filter performance can be 
improved in a straightforward manner by increasing the order 
of the filter. When these filters are to be realized by parallel 
coupled microstrip lines, one of the main limitations is the 
small gap size of first and last coupling stages. To increase the 
coupling efficiency, more fractional bandwidth and smaller 



II. Bandpass Filter Design 

Figure 1 shows one possible circuit arrangement for 
bandpass filter using parallel coupled line microstrip structure 
at center and broad side coupling at end of the geometry 
designed in LTCC for UWB range. It consists of transmission 
line sections having the length of half wavelength at the 
corresponding center frequency. Half wavelength line 
resonators are positioned so that adjacent resonators are 
parallel to each other along half of their length. This parallel 
arrangement gives relatively large coupling for the given 
spacing between the resonators, and thus, this filter structure is 
particularly convenient for constructing filters having larger 
bandwidth as compared to the other structures [14] -[17]. 




Figure 1. Geometry of the proposed UWB bandpass filter 
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The gap between the resonators is introducing a capacitive 
coupling, which can be represented by a series capacitance. 
The broad side coupling and existence of the substrate result 
tight coupling, which provides wide bandpass operation. The 
physical parameters of the proposed bandpass filter are 
optimized to the following values, li=5.89 mm; 1 2 =5.86 mm; 
d=0.2 mm; g = 0.08 mm; g\ = 0.1 mm; aO = 1.06 mm; 
ai = 1.2 mm; a 2 = 1.02; w = 1.6 mm; b= 1.6 and h= 33.5 mm 
to cover the entire UWB range between 2 GHz and 9 GHz. 
Using this configuration, higher coupling is obtained and 
therefore wider bandwidth is achieved. This structure is used 
to generate a wide passband and expected to achieve a tight 
coupling, and lower insertion by reducing both strip and slot 
width. 3D view of a LTCC UWB bandpass filter with parallel 
and broadside coupling is shown in Figure 2, which consists of 
two layers, resonators and substrate with frames. 
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Figure 2. 3D view of proposed LTCC filter structure 

III. Results and Discussion 

The proposed filter is designed to provide a wide passband, 
low insertion loss and return loss, linear phase over the 
passband, flat group delay and high fractional bandwidth. 
The simulation S parameters of the proposed UWB bandpass 
filter using LTCC are shown in Figure 3. It is clear from the 
response that the proposed filter has better insertion loss of - 
0.09 dB and the low return loss of about -49 dB. The -10 dB 
fractional bandwidth computed from the response is about 
62.17 %. 

For wideband applications, the examination of the flat group 
delay is essential and required. The simulation group delay for 
the proposed filter is shown in Figure 4, which exhibits a flat 
group delay response below 0.1 ns over the whole passband. It 
implies that this proposed UWB filter has a very good linearity 
of signal transfer and would ensure the minimum distortion to 
the input pulse when it is implemented in the UWB system. 
The response of the Figure 5 shows that the phase of S 2 i 
throughout the -10 dB passband between 3.8 GHz and 
7.4 GHz of designed filter is acceptably linear. 

In order to evaluate the performance of the proposed UWB 
bandpass filter, the filter is simulated through the simulation 
tool, IE3D [18]. The filter is designed based on LTCC 
substrate with two upper sheet layers, thickness of 1.6 mm and 
1.93 mm with dielectric constant of 7.8 and a loss tangent of 
0.002. 
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Conclusion 

In this letter, a bandpass filter for UWB applications 
based on LTCC structure is presented. The proposed filter 
demonstrated an excellent ultra- wide bandwidth from 3.8 GHz 
to 7.4 GHz. Total size of the UWB filter is 33.5 mm (height) x 
1.6 mm (length) x 1.6 mm (breadth) and the fractional 
bandwidth is about 62.17 %. Simulation of bandpass filter 
delivers excellent scattering parameters with magnitude of 
insertion loss, IS 2 il lower than -0.09 dB and return loss better 
than -49 dB. The obtained group delay for this filter is below 
0.1 ns. 
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Abstract — The histogram of Discrete Cosine Transform 
coefficients contains information on the compression parameters 
for JPEGs and previously JPEG compressed bitmaps. In this 
paper we extend the work in [1] to identify previously 
compressed bitmaps and estimate the quantization table that was 
used for compression, from the peaks of the histogram of DCT 
coefficients. This can help in establishing bitmap compression 
history which is particularly useful in applications like image 
authentication, JPEG artifact removal, and JPEG recompression 
with less distortion. Furthermore, the estimated table calculates 
distortion measures to classify the bitmap as genuine or forged. 
The method shows good average estimation accuracy of around 
92.88% against MLE and autocorrelation methods. In addition, 
because bitmaps do not experience data loss, detecting 
inconsistencies becomes easier. Detection performance resulted in 
an average false negative rate of 3.81% and 2.26% for two 
distortion measures, respectively. 



Keywords: Digital image forensic s; forgery detection; compression 
history; Quantization tables. 

I. Introduction 

Although JPEG images are the most widely used image 
format, sometimes images are saved in an uncompressed raster 
form (bmp, tiff), and in most situations, no knowledge of 
previous processing is available. Some applications are 
required to receive images as bitmaps with instructions for 
rendering at a particular size and without further information. 
The image may have been processed and perhaps compressed 
with contain severe compression artifacts. Hence, it is useful 
to determine the bitmap history; whether the image has ever 
been compressed using the JPEG standard and to know what 
quantization tables were used. Most of the artifact removal 
algorithms [2-9] require the knowledge of the quantization 
table to estimate the amount of distortion caused by 
quantization and avoid over-blurring. In other applications, 
knowing the quantization table can help in avoiding further 
distortion when recompressing the image. Some methods try 
to identify bitmap compression history using Maximum 
Likelihood Estimation (MLE) [10-11] or by modeling the 
distribution of quantized DCT coefficients, like the use of 
Benford's law [12], or modeling acquisition devices [13]. 

Furthermore, due to the nature of digital media and the 
advanced digital image processing techniques, digital images 
may be altered and redistributed very easily forming a rising 



threat in the public domain. Hence, ensuring that media 
content is credible and has not been altered is becoming an 
important issue governmental security and commercial 
applications. As a result, research is being conducted for 
developing authentication methods and tamper detection 
techniques. Usually JPEG compression introduces blocking 
artifacts and hence one of the standard passive approaches is 
to use inconsistencies in these blocking fingerprints as a 
reliable indicator of possible tampering [14]. These can also be 
used to determine what method of forgery was used. 

In this paper we are interested in the authenticity of the 
image. We extend the work in [1] to bitmaps and use the 
proposed method for identifying previously compressed 
bitmaps and estimating the quantization table that was used. 
The estimated table is then used to determine if the mage was 
forged or not by calculating distortion measures. 

In section 2 we study the histogram of DCT AC 
coefficients of bitmaps and show how it differs for previously 
JPEG compressed bitmaps. We then validate that without 
modeling rounding errors or calculating prior probabilities, 
quantization steps of previously compressed bitmaps can still 
be determined straightforward from the peaks of the 
approximated histograms of DCT coefficients. Results are 
discussed in section 3. Section 4 is for conclusions. 

II. Histogram of DCT Coefficients in Bitmaps 

We studied in [1] the histogram of quantized DCT 
coefficients and showed how it can be used to estimate 
quantization steps. Here, we study uncompressed images and 
validate that the approximated histogram of DCT coefficients 
can be used to determine compression history. Bitmap image 
means no data loss and hence all what is required to build an 
informative histogram is expected to be present in the 
coefficients histograms. 

The first step is to decide if the test image was previously 
compressed because if the image was an original 
uncompressed there is no compression data to extract. When 
the image is decided to have a compression history, the next 
step is to estimate that history. For grayscale image, 
compression history mainly means its quantization table which 
will be the focus of this paper. For color image, this is 
extended to estimating color plane compression parameters 
that includes subsampling and associated interpolation. 
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Fig. 1. Histograms of X*(3,3). 

Fig. 1(b) shows the approximated histogram H of DCT 
coefficient at position (3,3) of the luminance channel of an 
uncompressed Lena image and the histogram of the image 
after being JPEG compressed with quality factor 80. It is clear 
that the latter contains periodic patterns that are not present in 
the uncompressed version. It was observed that the coefficient 
is very likely to have been quantized with a step of this 
periodic [15]. Now if that JPEG was stored in a bitmap 
uncompressed form, we expect the DCT coefficients to have 
the same behavior because nothing is lost during this format 
change. This is evident in Fig. 1(d) which shows an identical 
histogram to the one in Fig. 1(c). Hence, similar to the 
argument in [1], if we closely observe the histogram of H*(i,j) 
outside the main lobe, we notice that the maximum peak 
occurs at a value that is equal to the quantization step used to 
quantize X q (i,j). This observation applies to most low 
frequency AC coefficients. Fig. 2(a) and (b) show \H\, the 
absolute histograms of DCT coefficients for Lena of Fig. 1(a) 
at frequencies (3,3) and (3,4), respectively. As for high 
frequencies, the maximum occurred at a value matching Q(i,j) 
when \X*(i,j)\>B, (Fig. 2 (c) and (d)), where B is as follows: 



r = 



X (iJ)-X (i,j 

q 

= ^J)5c(u)c(v) 



u,v 



B(i,j) 



(2ld+l )l7T ( 2v + 1 )J7T 

COS- — .COS 



16 



16 



(1) 



and c(co) 



where X q (i,j) is the quantized coefficient, and X (ij) is the 
approximated quantized coefficient, r is the round off error, 

IV V2 for cd = 

[l otherwise 

See [1,11]. 

Sometimes we do not have enough information to 
determine Q(i,j) for high frequencies (ij). This happens when 
the histogram outside the main lobe decays rapidly to zero 
showing no periodic structure. This reflects the small or zero 
value of the coefficient. At such cases, it can be useful to 
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Fig. 2. (a) IX*(3,3)I where H max occurs at 0(3,3)=6. (b) IX*(3,4)I where H max 
occurs at 0(3,4) = 10 (c) IX*(5,4)I where H max occurs at g(5,4)=22. (d) 
IX* (7,5)1 where H max occurs at Q(l, 5) = 41. 

estimate as many of the low frequencies and then search 
through lookup tables for a matching standard table. 

Estimating the quantization table of a bitmap can help 
determine part of its compression history. If all (or most of) of 
the low frequency steps were estimated to be ones, we can 
conclude that the image did not go through previous 
compression. High frequencies may bias because they have 
very low contribution and do not provide a good estimate. 
Moreover, this method works well also for uncompressed or 
lossless compressed tiff images. Fig. 3(d) shows the 96.7% 
correctly estimated Q table using the above method of a tiff 
image taken from UCID [16]. The X's mark the 
"undetermined" coefficients. 

Now for verifying the authenticity of the image, we use the 
same distortion measures we used in [1]. The average 
distortion measure is calculated as a function of the 
remainders of DCT coefficients with respect to the original Q 
matrix: 

8 8 
^1=XX mod(D(i, j), Q(i, j)) (2) 

i=l y=l 
where D(i,j) and Q(i,j) are the DCT coefficient and the 
corresponding quantization table entry at position (i,j), 
respectively. An image block having a large average distortion 
value indicates that it is very different from what it should be 
and is likely to belong to a forged image. Averaged over the 
entire image, this measure can be used for making a decision 
about authenticity of the image. 

In addition, the JPEG 8^8 "blocking effect" is somehow 
still present in the uncompressed version and hence blocking 
artifact measure, BAM [14], can be used to give an estimate of 
the distortion of the image. It is computed from the Q table as: 



*2(*)=££ 
i=iji=i 



D(iJ) -Q(iJ) round] ^^~ 



(3) 



where B(n) is the estimated blocking artifact for the n block. 
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(c) Estimated Q for previously compressed version with QF = 80. (d) Difference between (c) and original table for QF=80. 

Fig. 3. Estimating Q table for original and previously compressed tif image. 



III. Experimental Re sutls And Discussion 

A. Estimation Accuracy 

Our testing image set consisted of 550 images collected 
from different sources (more than five camera models), in 
addition to some from the public domain Uncompressed Color 
Image Database (UCID), which provides a benchmark for 
image processing analysis [16]. Each of these images was 
compressed with different quality factors, [60, 70, 80, and 90]. 
Again, each of these was uncompressed and resaved as 
bitmap. This yielded 550x4 = 2,200 untouched images. For 
each quality factor group, an image's histogram of DCT 
coefficients at one certain frequency was generated and used 
to determine the corresponding quantization step at that 
frequency according to section 2. This was repeated for all the 
64 histograms of DCT coefficients. The resulting quantization 
table was compared to the quality factor's known table and the 
percentage of correctly estimated coefficients was recorded. 
Also, the estimated table was used in equations (2) and (3) to 
determine the image's average distortion and blocking artifact 
measures, respectively. These values were recorded and used 
later to set a threshold value for distinguishing forgeries from 
untouched images. 

Table 1 shows the accuracy of estimating all 64 entries 
using the proposed method for each quality factor averaged 
over the whole set. It exhibits a similar behavior to JPEG 
images; as quality factor increases, estimation accuracy 
increases steadily with an expected drop for quality factors 
higher than 90 as the periodic structure becomes less 
prominent and the bumps are no longer separate enough . 
Overall, we can see that the estimation accuracy is higher than 



TABLE I. 



Percentage of correctly estimated coefficients 
for severla qfs 



QF 


60 


70 


80 


90 


BMP 


82.07% 


84.80% 


87.44% 


89.44% 


JPEG[1] 


72.03% 


76.99% 


82.36% 


88.26% 



that of JPEG images [1]. We anticipate that because lossy 
compression tends to lessen available data to make a better 
estimate. Average estimation time for all 64 entries of images 
of size 640x480 for different QFs was 52.7 seconds. 

Estimating Q using MLE methods [10-11] is based on 
searching for all possible Q(iJ) for each DCT coefficient over 
the whole image which can be computationally exhaustive for 
large size files. Another method [12] proposed a logarithmic 
law and argued that the distribution of the first digit of DCT 
coefficients follows that generalized Benford's law. The 
method is based on re-compressing the test image with several 
quality factors and fitting the distribution of DCT coefficients 
of each version to the proposed law. The QF of the version 
having the least fitting artifact is chosen and its corresponding 
Q table is the desired one. Of course the above methods can 
only estimate standard compression tables. Although it may be 
accurate, it is time consuming. Plus it fails when the re- 
compression quantization step is an integer multiple of the 
original compression step size. Another method [17] tends to 
calculate the autocorrelation function of the histogram of DCT 
coefficients. The displacement corresponding to the peak 
closest to the peak at zero is the value of Q(i,j) given that the 
peak is higher than the mean value of the autocorrelation 
function. The method eventually uses a hybrid approach; the 
low frequency coefficients are determined directly from the 
autocorrelation function, while the higher-frequency ones are 
estimated by matching the estimated part to standard JPEG 
tables scaled by a factor of s, which is determined from the 
known coefficients. 

Table 2 shows the estimation accuracy while Table 3 
shows estimation time, for the different mentioned methods 
against ours. Note that accuracy was calculated for directly 
estimating only the first nine AC coefficients without 
matching. This is due to the methods failing to estimate high 
frequency coefficients as most of them are quantized to zero. 
On the other hand, the listed time is for estimating the nine 
coefficients and then retrieving the whole matching table from 
JPEG standard lookup tables. Maximum peak is faster than 
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TABLE II. 



Estimation accurcay for the firsy 3x3 AC 

COEFFICIENTS FOR SEVERAL QFS 



QF 

Method 


50 


60 


70 


80 


90 


100 


Avg. 
Ace. 


MLE 


75.31 


83.10 


90.31 


96.34 


93.83 


59.5 


83.06 


Benford 


99.08 


87.59 


80.82 


93.81 


59.47 


31.53 


75.38 


Auto. 


48.94 


50.37 


63.71 


81.43 


65.37 


57.50 


61.22 


Max.Peak 


97.93 


97.07 


99.01 


97.67 


89.57 


76.04 


92.88 



TABLE III. 



Estimation Time in seconds for the firsy 3x3 AC 

COEFFICIENTS FOR SEVERAL QFS 



QF 
Method 


50 


60 


70 


80 


90 


100 


MLE 


38.73 


37.33 


37.44 


37.36 


37.32 


34.14 


Benford 


59.95 


58.67 


58.70 


58.72 


58.38 


80.04 


Auto 


9.23 


11.11 


11.10 


11.12 


11.24 


8.96 


Max.Peak 


11.27 


11.29 


11.30 


11.30 


11.30 


11.56 



statistical modeling and nearly as fast as that autocorrelation 
method. However, average accuracy of our method is far 
higher. MLE is reliable with 83% accuracy but with more than 
double the time. Benford' s law based method has an accuracy 
of 75 % but is the worst in time because recompressing the 
image and calculating distributions for each compressed 
version may become time consuming for larger images. 
Images used in the experiments were of size 640x480. 

B. Forfery Detection 

From the untouched previously compressed bitmap image 
set, we selected 500 images for each quality factor, each of 
which was subjected to four common forgeries; cropping, 
rotation, composition, and brightness changes. Cropping 
forgeries were done by deleting some columns and rows from 
the original image. An image was rotated by 270° for rotation 
forgeries Copy-paste forgeries were done by randomly 
copying a block of pixels from an arbitrary image and placing 
it in the original image. Random values were added to every 
pixel of the image to simulate brightness change. The resulting 
fake images were then stored in their uncompressed form for a 
total of (500x4) x 4 = 8,000 images. Next, the quantization 
table for each of these images was estimated as above and 
used to calculate the image's average distortion, (2), and the 
blocking artifact, (3), measures, respectively. 

Fig. 4(a) and (b) show values of the average distortion 
measure and blocking artifact measure, respectively. The 
scattered dots represent 500 untouched images (averaged for 
all quality factors for each image) while the cross marks 
represent 500 images from the forged dataset. As the figure 




Untouched Images 

Forged Images 









iKMmi 



mMmwmm 



wssmm 



>'! 



1 25» I 35» i 450 i 
Image Index 

(a) Average distortion measure 



I 9 f 150 1 1 I » I 450 500 
lip Index 

(b) Blocking artifact measure 



Fig. 4. Distortion measures for untouched and tampered images. 

shows, values from forged images tend to cluster higher than 
those from untampered images. We tested the distortion 
measure for untouched images against several threshold values 
and calculated the corresponding false positive rate FPR (the 
number of untouched images declared as tampered), An ideal 
case would be a threshold giving zero false positive. However, 
we had to take into account the false negatives (the number of 
tampered images declared as untampered) that may occur 
when testing for forgeries. Hence, we require a threshold value 
keeping both FPR and the FNR low. For average distortion 
measure, we selected a value that gave FPR of 10.8% and a 
lower FNR as possible for the different types of forgeries for 
average distortion. The horizontal line marks this threshold x = 
50. Similarly, we selected the BAM's threshold to be x = 40, 
with a corresponding FPR of 5.6%. Table 4 shows the false 
negative rate (FNR) for the different forgeries at different 
quality factors for bitmaps and JPEGs. As expected, as QF 
increases, a better estimate of the quantization matrix of the 
original untampered image is obtained, and as a result the 
error percentage decreases. Notice how the values drop than 
those for JPEG file. Notice also that detection of cropping is 
possible when the cropping process breaks the natural JPEG 
grid, that it, the removed rows or columns do not fall in line 
with the 8x8 blocking. Similarly, when the pasted part fails to 
fit perfectly into the original JPEG compressed image, the 
distortion metric exceeds the detection threshold, and a 
possible composite is declared. Fig. 5 shows examples of 
composites. The resulting distortion measures for each 
composite are shown in left panel. The dark parts denote low 
distortion whereas brighter parts indicate high distortion 
values. Notice the highest values correspond to the alien part 
and hence mark the forged area. 



TABLE IV. 



Forgery Detection Error Rates for Bitmaps and JPEGs 



Distortion Measure 


Original 


Cropping 


Rotation 


Compositing 


Brightness 


Average 


JPEG 


12.6% 


9.2% 


7.55% 


8.6% 


6.45% 


BMP 


10.8% 


3.9% 


4.45% 


2.0% 


4.9% 


BAM 


JPEG 


6.8% 


3.3% 


5.95% 


3.15% 


5.0% 


BMP 


5.6% 


1.05% 


3.05% 


1.25% 


3.7% 
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IV. Conclusions 



The method discussed in this paper is based on using the 
approximated histogram of DCT coefficients of bitmaps for 
extracting the image's compression history; its quantization 
table. Also the extracted table is used to expose image 
forgeries. The method proved to have practically high 
estimation accuracy when tested on a large set of image from 
different sources compared to other statistical approaches. 
Moreover, estimation times proved to be faster than statistical 
methods while maintaining very good accuracy for lower 
frequencies. Experimental results also showed that 

performance for bitmaps surpasses that of JPEGs because of 
their lossy nature but on the other hand, it takes more time to 
process a bitmap. 
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(a) Three composite bitmap images. 
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(b) Distortion measure for the three images in (a). 

Fig. 5. Distortion measures for some composite bitmap images. The left panel represents the average distortion measure while the right panel represents the 
blocking artifact measure. 
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Abstract — In this work, We study how we used the error- 
correcting codes in steganographic protocols (sometimes also 
called the "matrix encoding"), which uses a linear code as an 
ingredient. Among all codes of a fixed block length and fixed 
dimension (and thus of a fixed information rate), an optimal 
code is one that makes most of the maximum length embeddable 
(MLE). the steganographic protocols are close in error-correcting 
codes. We will clarify this link, which will lead us to a bound on 
the maximum capacity that can have a steganographic protocol 
and give us a way to build up optimal steganographic protocols 

Keywords: Steganography, Error-correcting code, average 
distortion, matrix encoding, embeding efficient. 

I. Introduction 

The goal of digital steganography is to modify a digital 
object (cover) to encode and conceal a sequence of bits 
(message) to facilitate covert communication. This process 
is fundamentally different from cryptography, where the 
existence of a secret message may be suspected by anyone 
who can observe the scrambled ciphertext while it is 
communicated. 

A common technique in steganography is to embed the 
hidden message into a larger cover object (such as a digital 
image, for example) by slightly distorting the cover object in 
a way that on one hand makes it possible for the intended 
recipient to extract the hidden message, but on the other 
hand makes it very hard for everybody else to detect the 
distortion of the cover object (i.e., to detect the existence of 
the hidden message). The amount of noise that is naturally 
(inherently) present in the cover object determines the amount 
of distortion that can be introduced into the cover object 
before the distortion becomes detectable. 
An interesting steganographic method is known as matrix 
encoding, introduced by Crandall [2]. Matrix encoding 
requires the sender and the recipient to agree in advance 
on a parity check matrix H, and the secret message is then 
extracted by the recipient as the syndrome (with respect to H) 
of the received cover object. This method was made popular 
by Westfeld [1], who incorporated a specific implementation 
using Hamming codes in his F5 algorithm, which can embed t 
bits of message in 2 t — 1 cover symbols by changing, at most, 
one of them. Matrix encoding using linear codes (syndrome 
coding) is a general approach to improving embedding 
efficiency of steganographic schemes. The covering radius of 
the code corresponds to the maximal number of embedding 
changes needed to embed any message [8]. Steganographers, 



however, are more interested in the average number of 
embedding changes rather than the worst case. In fact, the 
concept of embedding efficiency-the average number of bits 
embedded per embedding change-has been frequently used 
in steganography to compare and evaluate performance of 
steganographic schemes. 

There are two parameters which help to evaluate the 
performance of a steganographic protocol [n,k,p] over a 
cover message of N symbols : the average distortion D = ^ , 
where R a is the expected number of changes over uniformly 
distributed messages; and the embedding rate E = -^, which 
is the amount of bits that can be hidden in a cover message. 
In general, for the same embedding rate a method is better 
when the average distortion is smaller. 

The remainder of the paper is organized as follows. Section 
2 gives a brief introduction to Error-correcting Codes. In 
Section 3 we introduces the relations between error-correcting 
codes and steganography. We construct our approach and re- 
port on experimental results in section 4. Section 5 gives a 
conclusion. 

II. Basics Of Coding Theory 

We now review some elementary concepts from coding 
theory that are relevant for our study. A good introductory 
text to this subject is, for example, [7]. Throughout the text, 
boldface symbols denote vectors or matrices. A binary code 
C is any subset of the space of all n-bit column vectors 
x = (xi,...,x n ) G {0, l} n . The vectors in C are called 
codewords. The set {0, l} n forms a linear vector space if 
we define the sum of two vectors and a multiplication of a 
vector by scalar using the usual arithmetics in the finite field 
GF{2) ; we will denote this field by F 2 . For any C,D C F£ 
and vector x, C + D = {y G Fg\y = c + d,c G C,d G D} ; 
x + C = {y G F£\y = x + c,c G C}. The Hamming weight 
w of a vector x is defined as the number of ones in x. The 
distance between two vectors x and y is the Hamming weight 
of their difference d(x, y) = w(x — y). For any x G C and a 
positive real number r, we denote as B(x,r) the ball with 
center x and radius r, B{x,r) = {y G F^^x^y) < r}. 
We also define the distance between x and set C C F% as 
d(x,C) = min ce cd(x,c). The covering radius R of C is 
defined as R = max xe F n d(x^C). The average distance to 
code C, defined as R a = 2~ n ^2 xeFn d(x, C), is the average 
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distance between a randomly selected vector from i 7 ^ and 
the code C. Clearly R a < R. 

Linear codes are codes for which C is a linear vector 
subspace of F£. If C has dimension k, we call C a linear 
code of length n and dimension k (and codimension n — k), 
or we say that C is an [n, k] code. Each linear code C of 
dimension k has a basis consisting of k vectors. Writing the 
basis vectors as rows of an k x n matrix G, we obtain a 
generator matrix of C. Each codeword can be written as a 
linear combination of rows from G. There are 2 k codewords 
in an [n, k] code. Given x, y G Fg, we define their dot product 
x.y = xiyi + X2y2 + ••• + ^n2M> all operations in GF{2). We 
say that x and y are orthogonal if x.y = 0. Given a code C, 
the dual code of C, denoted as C\ is the set of all vectors 
x that are orthogonal to all vectors in C. The dual code of a 
[n, k] code is a [n,n — k] code with an (n — k) x n generator 
matrix H with the property that 



Hx = o x <E C. 



(1) 



The matrix H is called the parity check matrix of C. For any 
x G F%, the vector 5 = Hx is called the syndrome of x. For 
each syndrome s G F 2 n_/c , the set C(s) = {x G F 2 n |i^x = 
5} is called a coset. Note that C(0) = C. Obviously, cosets 
associated with different syndromes are disjoint. Also, from 
elementary linear algebra we know that every coset can be 
written as C(s) = x + C, where x G C(s) arbitrary. Thus, 
there are 2 n ~ k disjoint cosets, each consisting of 2 k vectors. 
Any member of the coset C(s) with the smallest Hamming 
weight is called a coset leader and will be denoted as ex,(s). 

1) Lemma: Given a coset C(s), for any x G C(s), 
<i(x, C) = w(6l(s). Moreover, if d(x, C) = d(x, c ) for some 
c G C, the vector x — c is a coset leader. 

Proof : d(x,C) = min ce c w ( x — c ) = m i n yec(s) w (y) — 
w(ei,(s)). The second equality follows from the fact that if c 
goes through the code C, x — c goes through all members of 
the coset C(s). 

2) Lemma: If C is an [n, fc] code with a (n — fc) x n parity 
check matrix # and covering radius R, then any syndrome 
s G F^ _fc can be written as a sum of at most R columns 
of H and R is the smallest such number. Thus, we can also 
define the covering radius as the maximal weight of all coset 
leaders. 

Proof : Any x G F% belongs to exactly one coset C(s) and 
from Lemma 1 we know that d(x,C) = iu(e£,(s)). But the 
weight w(eL(s)) is the smallest number of columns in H that 
must be added to obtain s. 

III. Linear Codes for Steganography 

The behavior of a steganographic algorithm can be sketched 
in the following way: 

1) a cover-medium is processed to extract a sequence of 
symbols v, sometimes called cover-data; 

2) v is modified into s to embed the message ra; s is 
sometimes called the stego-data; 



3) modifications on s are translated on the cover-medium 
to obtain the stego-medium. 

Here, we assume that the detectability of the embedding 
increases with the number of symbols that must be changed 
to go from v to s ( [5] for some examples of this framework). 
Syndrome coding deals with this number of changes. The 
key idea is to use some syndrome computation to embed the 
message into the cover-data. In fact, such a scheme uses a 
linear code C, more precisely its cosets, to hide ra. A word s 
hides the message ra if s lies in a particular coset of C, related 
to ra. Since cosets are uniquely identified by the so-called 
syndromes, embedding/hiding consists exactly in searching s 
with syndrome ra, close enough to v. 

A. Matrix Encoding 

We first set up the notation and describe properly the matrix 
encoding framework and its inherent problems. Let v G F™ 
denote the cover-data and ra G F^ the message. We are 
looking for two mappings, embedding Emb and extraction 
Ext, such that 

V(>, ra) G F™ x F r q1 Ext(Emb(v, ra)) = ra. (2) 



VO, ra) G F™ x F r q , d(v, Emb(v, ra)) < T. (3) 

Equation (2) means that we want to recover the message in 
all cases; (3) means that we authorize the modification of at 
most T coordinates in the vector v. 

From Error Correcting Codes (Section 2), it is quite easy to 
show that the scheme defined by 



Emb(v, ra) = v + D(m — E(v)) 



Ext(y) = E(y) = y x R l 



(4) 



(5) 



D and E mean respectively the decoding function and the 
function of the syndrome, enables to embed messages of 
length r = n — k in a cover-data of length n, while modifying 
at most T = R elements of the cover-data. 
The parameter 13 ^ represents the embedding efficiency, that 
is, the number of embedded symbols per embedding changes. 
Linking symbols with bits is not simple, as naive solutions 
lead to bad results in terms of efficiency. For example, if 
elements of F q are viewed as blocks of L bits, modifying a 
symbol roughly leads to |f bit flips on average and L for the 
worst case. 

A problem raised by the matrix encoding, as presented 
above, is that any position in the cover-data v can be changed. 
In some cases, it is more reasonable to keep some coordinates 
unchanged because they would produce too big artifacts in the 
stego-data. 
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1) Example: We now give an example of a proto- 
col steganography constructed from a linear single-error- 
correcting code. This was also discussed, for example, in [3]. 
Start from the matrix 

/ 1 1 1 1 

H= 10 10 11 

\ 1 1 1 1 

whose entries are elements of F^. Extracting Scheme is defined 



Ext : Fl 



Fl 



Ext(x 1 ,x 2 ,X3,X4,X5,X6,x 7 ) = (j/i, y 2 , 2/3), where y 1 = 
xi + x 4 + x 5 + x 7 , 2/2 = 2?2 + £4 + a?6 + ^7 and y 3 = 
^3+^5 + ^6+^7- This function can be described in terms of 
matrix H.ln fact, yi, is the dot product of x and the z-th row 
of H. We claim that Ext is a extracting function of proto- 
col steganography (7, 3, 1) Embedding Scheme for example, 
£^(0,0,1,1,0,1,0) = (1,0,0). Assume y = (1,1,1). We 
claim that it is possible to replace x = (0, 0, 1, 1, 0, 1, 0) by 
x' such that Ext(x') = (1,1,1) and d(x,x') = 1. In fact, 
we claim more: the coordinate where x has to be changed is 
uniquely determined. In our case, this is coordinate number 
6, so x' = (0,0,1,1,0,0,0), Here is the general embedding 
rule: form Ext(x) + y, (in the example this is 011). Find the 
column of H which has these entries (in our example, this is 
the sixth column). This marks the coordinate where x needs to 
be changed to embed pay load y. This procedure indicates how 
H and Ext were constructed and how this can be generalized: 
the columns of H are simply all nonzero 3 -tuples in some 
order. In general, we start from our choice of n and write 
a matrix H whose columns consist of all nonzero n- tuples. 
Then H has N = 2 n — 1 columns. The extracting function, 



Ext 



F 2 N 



F% is defined by way of the dot products with 



iv. asymptoticly tight bound on the 
Performance of Embedding Schemes 

Our goal in this section is obtain steganographic proto- 
cols asymptotically optimal. Initially, the relationship updates 
in Section 3, between steganographic protocols and error- 
correcting codes, we used to translate the bounds on the error- 
correcting codes in bounds, upper and lower, of the maximum 
number of messages in a schema. The bounds on the error- 
correcting codes are known to be achieved by linear codes, we 
use this result to the next section to construct our protocols. 

2) Proposition: [(Zhang [2], Theorem 6).] 

The parameters [n,k,p] of a steganographic protocol de- 
fined over a field F q of cardinality q, verify that q k < V q (n, p). 

Proof, it suffices to prove the result for proper 
steganographic protocols. Take x G F™. For all s G F k 
there exists y G B(x,p) such that Ext(y) = s, hence 
card(B(x,p)) > card(F^) 



3) Proposition: Denote by r^n^p) the largest possible 
value of r = n — k for [n,k,p] steganographic protocol, 
similarly let r(n, p) denote the largest possible value of 
r = n — log(M) or M the number of messages that can 
hide using binary coverwords of length n. Let us recall the 
following result 

p p 

log(S2 C n) - l °g( n ) < r Un, p) < r(n, p) < log(V^ CI 



=o 



=0 



the rows of H. Finally it is clear that embedding efficiency = 3. ^3] 



4) Corollary: Let h(n,T) be the maximal number of bits 
embeddable by schemes using binary coverwords of length n 
with quality threshold T. We have 

r L (n,T)<h(n,T)<r{n,T) 

Proof. Hence, an steganographic protocol (with quality 
threshold T) is a slightly stronger condition than T-covering, 
because steganographic protocol generates not a single, but 
\X\ disjoint T-coverings. But in particular case of linear 
coverings these two notions coincide. 

V. Conclusion 

Application of error-correcting codes to data embedding 
improves embedding efficiency and security of steganographic 
schemes. In this paper, we show some relations between 
steganographic algorithms and error-correcting codes. By us- 
ing these relations we give some bound on the performance 
of embedding schemes 
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Abstract: Several efficient algorithms were 
developed to cope with the popular task of 
sorting. Kakkot sort is a new variant of 
Quick and Insertion sort. The Kakkot sort 
algorithm requires 0( n log n ) 
comparisons for worst case and average 
case. Typically, Kakkot Sort is significantly 
faster in practice than other O ( n log n ) 
algorithms , because its inner loop can be 
efficiently implemented on most 
architectures . This sorting method requires 
data movement, but less than that of 
insertion sort. This data movement can be 
reduced by implementing the algorithm 
using linked list. In this comparative study 
the mathematical results of Kakkot sort 
were verified experimentally on ten 
randomly generated unsorted numbers. To 
have some experimental data to sustain this 
comparison four different sorting methods 
were chosen and code was executed and 
execution time was noted to verify and 
analyze the performance. The Kakkot Sort 
algorithm performance was found better as 
compared to other sorting methods. 

Key words: Complexity, performance of 
algorithms, sorting 



Introduction 

Sorting is any process of arranging items in 
some sequence and/or in different sets, and 
accordingly, it has two common, yet distinct 
meanings: 



1. 



2. 



ordering: arranging items of the 
same kind, class, nature, etc. in some 
ordered sequence, 

categorizing: grouping and labeling 
items with similar properties together 
(by sorts). 



In computer science and mathematics, a 
Sorting Algorithm is an algorithm that puts 
elements of a list in a certain order. The 
most-used orders are numerical order and 
lexicographical order. Efficient sorting is 
important to optimizing the use of other 
algorithms (such as search and merge 
algorithms) that require sorted lists to work 
correctly. 

To analyze an algorithm is to determine the 
amount of resources (such as time and 
storage) necessary to execute it. Most 
algorithms are designed to work with inputs 
of arbitrary length. Usually the efficiency or 
complexity of an algorithm is stated as a 
function relating the input length to the 
number of steps (time complexity) or 
storage locations (space complexity). 
Algorithm analysis is an important part of a 
broader computational complexity theory, 
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which provides theoretical estimates for the 
resources needed by any algorithm which 
solves a given computational problem. 
These estimates provide an insight into 
reasonable directions of search for efficient 
algorithms. In theoretical analysis of 
algorithms it is common to estimate their 
complexity in the asymptotic sense, i.e., to 
estimate the complexity function for 
arbitrarily large input. Big O notation, 
omega notation and theta notation are used 
to this end 

Time complexity 

Time efficiency estimates depend on what 
we define to be a step. For the analysis to 
correspond usefully to the actual execution 
time, the time required to perform a step 
must be guaranteed to be bounded above by 
a constant. In mathematics, computer 
science, and related fields, Big Oh notation 
describes the limiting behavior of a function 
when the argument tends towards a 
particular value or infinity, usually in terms 
of simpler functions. Big O notation allows 
its users to simplify functions in order to 
concentrate on their growth rates: different 
functions with the same growth rate may be 
represented using the same O notation. 

Although developed as a part of pure 
mathematics, this notation is now frequently 
also used in computational complexity 
theory to describe an algorithm's usage of 
computational resources: the worst case or 
average case running time or memory usage 
of an algorithm is often expressed as a 
function of the length of its input using big 
O notation. 

Space complexity 

The better the time complexity of an 
algorithm is, the faster the algorithm will 
carry out his work in practice. Apart from 



time complexity, its space complexity is also 
important: This is essentially the number of 
memory cells which an algorithm needs. A 
good algorithm keeps this number as small 
as possible, too. The space complexity of a 
program (for a given input) is the number of 
elementary objects that this program needs 
to store during its execution. This number is 
computed with respect to the size n of the 
input data. 

There is often a time-space-tradeoff 
involved in a problem, that is, it cannot be 
solved with few computing time and low 
memory consumption. One then has to make 
a compromise and to exchange computing 
time for memory consumption or vice versa, 
depending on which algorithm one chooses 
and how one parameterizes it. 

In addition to varying complexity, sorting 
algorithms also fall into two basic categories 
— comparison based and non-comparison 
based. A comparison based algorithm orders 
a sorting array by weighing the value of one 
element against the value of other elements. 
Algorithms such as Quicksort, Mergesort, 
Heapsort, Bubble sort, and Insertion sort are 
comparison based. Alternatively, a non- 
comparison based algorithm sorts an array 
without consideration of pairwise data 
elements. Radix sort is a non-comparison 
based algorithm that treats the sorting 
elements as numbers represented in a base- 
M number system, and then works with 
individual digits of M. 

Another factor which influences the 
performance of sorting method is the 
behavior pattern of the input. In computer 
science, best, worst and average cases of a 
given algorithm express what the resource 
usage is at least, at most and on average, 
respectively. Usually the resource being 
considered is running time, but it could also 
be memory or other resources. 
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Kakkot Sort 

Kakkot Sort is a sorting algorithm that , 
makes O ( n log n ) (Big Oh notation) 
comparisons to sort n items. Typically, 
Kakkot Sort is significantly faster in 
practice than other O ( n log n ) algorithms , 
because its inner loop can be efficiently 
implemented on most architectures . This 
sorting method requires data movement but 
less than that of insertion sort. This data 
movement can be reduced by implementing 
the algorithm using linked list. Major 
advantage of this sorting method is its 
behavior pattern is same for all cases, ie 
time complexity of this method is same for 
best, average and worst case 
How it sorts 

From the given set of unsorted numbers, 
take the first two numbers and name it as 
key one and key two , ie, Kl and K2. Read 
all the remaining numbers one by one. 
Compare each number first with K2. If the 
number is greater than or equal to K2 then 
place the number right of K2 else compare 
the same number with Kl. If the number is 
greater than Kl then place the number 
immediate right of Kl else left of 
Kl.Conitnue the same process for all the 
remaining numbers in the list. Finally we 
will get three sub lists. One with numbers 
less than or equal to Kl, one with numbers 
greater than or equal to K2 and the other 
with numbers between Kl and K2. Repeat 
the same process for each sub list. Continue 
this process till the sub list contains zero 
elements or one element. 



Stepl. Read the first two numbers from N, 

Let Kl & K2 
Step2. Sort Kl and K2 
Step3. Read the next number, Let A 
Step4. Compare A with K2 

Step5. If A is greater than or equal to K2 

then place A right of K2 

else 

compare A with Kl. 

If A is less than Kl 

then place A left of Kl 

else 

Place A immediate right of Kl 
Step6 . If the list contains any more 

elements go to step 3 
Step 7. Now we have 3 Sub list. 

S First list with all values less 
than or equal to Kl. 

S Second with values between 
Kl and K2 

S Final with values greater than 
or equal to K2. 

Step8. If each list contains more than 1 
element go to stepl 

Step 9 End. 

Time complexity 

If there are 'n' numbers, then each iteration 
needs maximum 2 * (n-2) comparison and 
minimum of n-2 comparison and plus one. 
So if we take the average it will be 

=(2n-4+n-2)/2 + 1 
=(3n-6)/2+l 

= 3n/2 - 2 



Algorithm 

Kakkot Sort(N: Array of Numbers, Kl ,K2 , 

A:integers,) 



In the average case each list would have 3 

sub lists and number of iteration will be 

3 x =n 

taking logarithm on both side we get 
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x log 3= log n 

x= log n / log 3 

x= log n/ 0.4771 

Ignoring the constant we can write x = log n 

That is there will be log n iterations and 
each require 3n/2 - 2 comparisons. So the 
time complexity of Kakkot Sort in average 
case is 3n/2 - 2 * log n. When we represent 
in Big Oh notation constants can be ignored, 
so we get 0(n log n). 

If the list is already in sorted order, then two 
comparison will be required for each 
number ,so total no of comparison required 
for each iteration will be (n-2)+l, i.e. n-1 
and number of iteration will be n-l+n-3+n- 

6+ +1 

This can be written as 
1+3+5+ n-3+n-l. 

Sum of this series is 

S=N/2*(2a+(N-l)*d) 

Where N is the number of terms in the series 

'a' is first term 

'd' is the difference 

To get N th term, the equation is a+(N-l) d 

And here N th term is n- 1, so 

l+(N-l)*2=n-l 

2N=n 

N=n/2 

S=N/2(2*1+(N-1)*2) 

S=N/2(2+2N-2) 

S=N/2(2N) 

Sum= N 2 

Substitute value for N we get 

(n/2)**2 

This is equal to one forth of n 2 . So Kakkot 
Sort requires only one forth of Quick sort 
comparison in worst case. This is almost 
equal to average case time complexity. So 
we can say that time complexity of Kakkot 
sort is similar in all the cases. 

Now let me manually calculate the number 
of comparison that Kakkot sort take. 



Consider the following randomly generated 
ten unsorted numbers 

1,60,33,3,35,21,53,19,70,94 
List 1 

First two numbers are 1 and 60 

and sort it . Here Kl is 1 and K2 is 60 

Now the total comparison is one. 

Read the remaining numbers one by one 

Read 33, since 33 is less than K2 and greater 

than Kl it need two comparison . Now the 

total comparison is increased to 3. 

Read 3, total comparison is now 5 

Read 35, total comparison is now 7 

Read 21, total comparison is now 9 

Read 53, total comparison now is 1 1 

Read 19, total comparison now is 13 

Read 70, total comparison now is 14 

Read 94 total comparison now is 15 

Now the list will be 

1, 3,35,21,53,19,60,70,94 

Here we have 3 sublist 

The first one with zero elements 

Second list is , 3,35, 35,21,53,19 

Third list is 70,94 

Now do the same process second and third 

list 

Second list 

Read first two numbers, and sort 

We have Kl =3 and K2= 35 

Now total comparison is 16 

Read 21, total comparison now is 18 

Read 5 3, total comparison now is 19 

Read 19,total comparison now is 21 

Now the list will be 

3,19,21,35,53 

Now only one list with more than one 

element, ie 19 and 21 

Read the first two numbers and sort 

HereKl=19andK2=21 

Now the total comparison is 22 

Now regarding the sublist 3 we have two 

numbers 70 and 94 

Read the numbers and sort 

Now the total number of comparison is 23 
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So Using Kakkot sort , to sort the given ten 
randomly generated numbers require only 
23 comparisons. 

Kakkot Sort and Qucick Sort 

Time complexity of Quick sort is 0(n log n) 
in the case of average case and 0(n ) in the 
worst case behavior. From this it is clear that 
Kakkot sort is better than quick sort. While 
sorting Quick sort does not require any data 
movement where as Kakkot sort needs data 
movement when the item is less than first 
key element and greater than second key 
element. But this data movement can be 
avoided by implementing the algorithm 
using linked list. 

To sort the above ten numbers in the List 1 , 
Quick sort requires 29 comparisons 

Kakkot Sort and Heap Sort 

Heapsort is a much more efficient version of 
selection sort. It also works by determining 
the largest (or smallest) element of the list, 
placing that at the end (or beginning) of the 
list, then continuing with the rest of the list, 
but accomplishes this task efficiently by 
using a data structure called a heap, a special 
type of binary tree. Once the data list has 
been made into a heap, the root node is 
guaranteed to be the largest(or smallest) 
element. When it is removed and placed at 
the end of the list, the heap is rearranged so 
the largest element remaining moves to the 
root. Using the heap, finding the next largest 
element takes 0(log n) time, instead of O(n) 
for a linear scan as in simple selection sort. 
This allows Heapsort to run in 0(n log n) 
time, and this is also the worst case 
complexity. 

With the same set of unsorted numbers in 
the List 1, Heap sort requires 30 
comparisons 



Kakkot Sort and Bubble Sort 

Bubble sort is a straightforward and 
simplistic method of sorting data that is used 
in computer science education. The 
algorithm starts at the beginning of the data 
set. It compares the first two elements, and if 
the first is greater than the second, then it 
swaps them. It continues doing this for each 
pair of adjacent elements to the end of the 
data set. It then starts again with the first two 
elements, repeating until no swaps have 
occurred on the last pass. This algorithm is 
highly inefficient, and is rarely used[citation 
needed] [dubious - discuss], except as a 
simplistic example. For example, if we have 
100 elements then the total number of 
comparisons will be 10000. A slightly better 
variant, cocktail sort, works by inverting the 
ordering criteria and the pass direction on 
alternating passes. The modified Bubble sort 
will stop 1 shorter each time through the 
loop, so the total number of comparisons for 
100 elements will be 4950. 

Bubble sort average case and worst case are 
both 0(n 2 ) 

For the above unsorted numbers in the List 1 
Bubble sort requires 45 comparisons. 



Kakkot Sort and Insertion Sort 

Insertion sort is a simple sorting algorithm 
that is relatively efficient for small lists and 
mostly-sorted lists, and often is used as part 
of more sophisticated algorithms. It works 
by taking elements from the list one by one 
and inserting them in their correct position 
into a new sorted list. In arrays, the new list 
and the remaining elements can share the 
array's space, but insertion is expensive, 
requiring shifting all following elements 
over by one. Shell sort (see below) is a 
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variant of insertion sort that is more efficient 
for larger lists. 

Insertion sort requires 38 comparisons to 
sort the above ten randomly generated 
numbers in the List 1 . 



Conclusion 

From the above examples it is clear that 
Kakkot Sort time complexity is better than 
other sorting methods. Even though Kakkot 
sort requires data movement of items when 
the item is less than the key K2 and greater 
than the key Kl, this data movement can be 
reduced by implementing the algorithm 
using linked list. 
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Abstract — In this work we propose a novel Steganographic 
method for hiding information within the spatial domain of the 
gray scale image. The proposed approach works by dividing the 
cover into blocks of equal sizes and then embeds the message in 
the edge of the block depending on the number of ones in left 
four bits of the pixel. The purpose of this work is to generalize 
the PVD method [7] With four-pixel differencing instead of two- 
pixel differencing and use the LSB Substitution to hide the secret 
message in the cover image 

Keywords: Steganography, Watermarking, Least Signifi- 
cant Bit(LSB), PVD method, Digital Images, Information 
Hiding, Pixel- value differencing. 

I. Introduction 

Steganography is the art of stealth communication. Its pur- 
pose is to make communication undetectable. The steganogra- 
phy problem is also known as the prisoners' dilemma formu- 
lated by Simmons [4]. Alice and Bob are imprisoned and want 
to hatch an escape plan. They are allowed to communicate via 
a channel monitored by a warden. If the warden finds out that 
they are communicating secretly, he throws them into solitary 
confinement. Thus, the prisoners need to design a method to 
exchange messages without raising the warden's suspicion. 
The prisoners hide their messages in innocuous-looking cover 
objects by slightly modifying them (obtaining stego objects). 
The embedding process is usually driven by a stego key, which 
is a secret shared between Alice and Bob. It is typically used 
to select a subset of the cover object and the order in which the 
cover object elements are visited during embedding. The most 
important property of any steganographic communication is 
statistical undetectability. In other words, the warden should 
not be able to distinguish between cover and stego objects. 
Formal description of this requirement in information-theoretic 
terms was given by Cachin [5]. If the communication channel 
that Alice and Bob use is distortion-free, we speak about the 
passive warden scenario. 

The most common and well-known steganographic method is 
called least significant bit (LSB) substitution, which embeds 
secret data by replacing k LSBs of a pixel with k secret bits di- 
rectly [1]. Many optimized LSB methods have been proposed 
to improve this work [2], [3]. The human perceptibility has 
a property that it is sensitive to some changes in the pixels 
of the smooth areas, while it is not sensitive to changes in 
the edge areas. Not all pixels in a cover image can tolerate 



equal amount of changes without causing noticeable distortion. 
Hence, to improve the quality of stego images, several adaptive 
methods have been proposed in which the amount of bits to 
be embedded in each pixel is variable. Wu and Tsai proposed 
a novel steganographic method that uses the difference value 
between two neighboring pixels to determine how many secret 
bits should be embedded [7]. 

In contrary : Steganalysis methods attempt to detect Stego- 
image and extract it. Inserting secret bits in image changes 
some statistics of image, this opens some roads to detect Stego- 
image. So the changes made by Steganographic are a key 
performance metric ; lower change : more robust algorithm. 
It is evident that the changes in cover image are related to the 
volume of inserted bit, so Stego-images with higher insertion 
rate are detected more easily. 

Stegananalysis methods generally are divided in two main 
groups: active and passive methods. In passive methods only 
presence or absence of hidden data is considered, while in 
active methods a inserted data is extracted [8]. Furthermore, 
different steganalysis methods, depending on steganography 
algorithms they target, can be classified in two groups : Model- 
based (Specific) and Universal Steganalysis. 
The aim of this work is to generalize the PVD method [7] With 
four-pixel differencing instead of two-pixel differencing and 
LSB Substitution. The remainder of the paper is organized as 
follows. Section 2 gives a brief introduction to Steganography 
and Data Hiding Methods. We construct our approach and 
report on experimental results in section 3 and 4. Section 5 
gives a conclusion. 

II. Digital Images in Steganography 

A. Digital Images 

A digital image at the most abstract level is a two- 
dimensional array of colored pixels or dots. When these pixels 
are displayed on a high-resolution monitor and viewed at 
an appropriate distance, they appear to be a continuously 
colored image. Each pixel is a certain color which is typically 
defined, using the redgreen- blue (RGB) color model, as 
a combination of varying amounts of red, green, and blue 
light. A color image is therefore said to contain three bands, 
each of which represents the amount of red, green, or blue 
light in the image. Whereas a color image contains color 
and intensity information, a gray- scale image is composed of 
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pixels that vary only in intensity, not color. Gray- scale images 
therefore have only a single band. Without loss of generality, 
the remaining discussion will focus on gray-scale images. The 
discussion is easily extended to cover color images by noting 
that a color image is the composition of three individual gray- 
scale images representing the red, green and blue bands. The 
typical gray-scale image has an 8-bit depth which is sufficient 
to represent 256 unique intensity values ranging from black 
to white [9]. A brief review of binary representation will be 
instructive when interpreting bit-level pixel data in the context 
of a digital image. An 8 -bit binary numeral has the general 
form 



A 7 2 7 + A 6 2 6 



A 1 2 1 +A 2° 



where A n represents a single binary digit. In a digital image 
it is clear that A 7 is the most significant bit and indicates 
whether the pixel value is greater than 127. A common means 
of converting a grayscale image to a binary (i.e. black-and- 
white) image is to extract the A 7 bit from each pixel. By 
contrast, Aq embodies relatively little information and, in the 
context of a digital image, can generally be understood as a 
noise channel. 

B. Overview of Steganograhy 

Steganography hides secret messages under the cover of 
a carrier signal so it cannot be seen or detected [6], [8], 
[11]. Steganography technique should generally possess two 
important properties: good visual/statistical imperceptibility 
and a sufficient payload. The first is essential for the security 
of hidden communication and the second ensures that a large 
quantity of data can be conveyed [10]. Two levels of protection 
can be done if the message is encrypted before hiding it, so 
it must be decrypted before reading it. Invisible watermarking 
is treated as a subset of steganography [10]. 




Fig. 1 . Embedding of a secret message into the three least significant channels 
of a cover image 



The difference is that steganography conceals a message 
so that this hidden message is the object of the communi- 
cation where in watermarking; the hidden message provides 
important information about the cover media, such as au- 
thentication or copyright. Steganography, in the simplest case, 
capitalizes on this overabundance of information by replacing 
the noise channels (i.e. the least significant bit channels) 
with an arbitrary secret message. Figure 1 gives an overview 
of a steganographic process flow. A source image, hereafter 
referred to as a cover, is viewed as 8 information carrying 
channels. A secret message is spread over the least significant 
channels (in this case the three least significant channels) 
with the modified channels re-combined to obtain an output, 
hereafter referred to as the stego image, that visually resembles 
the cover image and contains the injected message. 

III. PVD Method for Gray-Level Image 

The pixel-value differencing (PVD) method [7] segments 
the cover image into nonoverlapping blocks containing two 
connecting pixels and modifies the pixel difference in each 
block (pair) for data embedding. A larger difference in the 
original pixel values allows a greater modification. The hiding 
algorithm is described as follows: 

1) Calculate the difference value di for each block of two 
consecutive pixels Pi and Pi+i, di = P*+i — Pi 

2) Find the optimal Ri of the di such that Ri = min(ui — 
k), where ui > k, k = \di\ and Ri G [k,Ui] 

3) Decide t bits of secret data which are hidden with each 
di, i.e. each block of two consecutive pixels is defined 
as t = log2(wi) where wi is the width of the Ri 

4) Read t bits binary secret data one by one according to 
Step 3, and then transform t into decimal value b. For 
instance, assume a binary secret data is 101, then 6 = 5. 

5) Calculate the new difference value d\ using: d' i = li + b, 
for di > or d[ = -(k + 6), for d* < 

6) Pi and P^ + i are modified to hide t secret data by the fol- 
lowing formula: (P!,P! +1 ) = (Pi - \f],P i+ i + Lf J) 
: d % G odd or (P/,P/ +1 ) = (Pi - [f\,Pi+i + \f 1) : 
di G even where m = d[ — di. Finally, we compute the 
values of (P/,P/ +1 ) which represent the secret data. 

7) Repeat Steps 1-6, until all secret data are hidden into 
the cover image and the stego-image is obtained. 

In the extraction phase, the original range table is necessary. 
It is used to partition the stego-image by the same method as 
used to the cover image. The extraction phase is implemented 
as follows: 

1) Calculate the difference value d[ between each two 
successive pixels for each block (P/,P/ +1 ) from the 
following formula : d\ = \P( +1 - P/| 

2) Find the optimum Ri of the d[ just as in Step 2 in the 
hiding phase. 

3) Obtain b' by subtracting U from d[. The b' value repre- 
sents the value of the secret data in decimal. 

4) Convert b' into binary then find number of bits t from 
the secret data, where t = log2(wi) [7] 
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IV. Proposed Steganography Scheme 

In this section we discuss the proposed approach for hiding 
information within the spatial domain of the gray scale image. 
The proposed approach works by dividing the cover into 
blocks of equal sizes (8x8). Our proposed method adaptively 
embeds messages using two levels (lower-level and higher- 
level), and the square of median value M is used to partition 
the average difference D into two levels. If D < M, D 
belongs to "lower-level" (i.e., the block belongs to a smooth 
area). Otherwise, D belongs to "higher-level" (i.e., the block 
belongs to an edge area). 

A. Determine The Place of Embedding in The Image 

All the pixels in the cover image are 256 gray values. 
The cover image is partitioned into non-overlapping four-pixel 
blocks. For each block, there are four neighboring pixels pij, 
Pij+i, Pi+i,j, Pi+i,j+i, and their corresponding gray values 
are yi, y 2 , V3 and y±, respectively. 

1) Divide the cover into blocks of equal sizes 8x8 

2) Calculate the square root of median for each block. M = 
^(median) 

3) Calculate the average difference value D, which is given 

by D = |ELo(^+i ~Vi) 

4) IF D > M, then embed Message in pij, Pij+i, Pi+ij, 
Pi+ij+i, (go to The embedding algorithm) 

B. The embedding algorithm 

1) Split each pixel into two equal parts (see Figure 2). 

2) Count number of 1 in the most part and embed a secret 
message in the least part according to the corresponding 
number of bits in Table 1 . 







Pi 




r 


J 




^ 


| 8 ! 7 


1 s 1 


5 


4 ! ! : ! : ! 


Most Part 








Least Fart 



Fig. 2. Split Process. 



number of 1 in 
the most part 



4 or 3 

2 
1 or 



number of Bits 
to be embedded 



3 bits 

2 bits 

1 



The recipient uses the extraction algorithm in order to 
extract the secret message from the stego-image. Extracting 
secret message is done in the same way as in the embedded 
operation, depending on the value of the median: M = 
\J (median). If the average difference value D is more than 
the value of M then extract the message depending on the 
rule in Table 1. 



V. Experimental results 

Several experiments are preformed to evaluate our proposed 
method. Ten gray-scale images with size 512x512 are used in 
the experiments as cover images, and three of them are shown 
in Fig. 3. A series of pseudo-random numbers as the secret bit 
streams are embedded into the cover images. The peak signal 
to noise ratio (PSNR) is utilized to evaluate the quality of the 
stego image. For anMxiV gray-scale image, the PSNR value 
is defined as follows: 

_ 7 255 x 255 xMxN 



PSNR 



i) 



where P^ and Qij denote the pixel values in row i and column 
j of the cover image and the stego image, respectively. In this 
section we present the experimental results of stego-image on 
three will known images: Lena, Pepper, and Baboon images. 
These images are shown in Figs 3. The quality of stego-image 
created by our proposed method are shown in Figs. 4. As 
the figures show, distortions resulted from embedding are 
imperceptible to human vision. We present also a comparative 
study of the proposed methods with PVD method. 

We have analyzed our results according to PVD method for 
each of the tested images. We also analyzed our results by 
computing Pay load, and peak signal-to noise ratio (PSNR). 




Fig. 3. Three cover images with size 512 x 512: (a) Lena (b) Peppers (c) 
Baboon. 

Pay load: the size of date that could be imbedded within the 
cover-image is shown in Table 2 



Image 


Image size 


Data size 


Data size 






(PVD) 


(Proposed Method) 


Lena 


128 x 128 


2048 


2493 




255 x 255 


8192 


10007 




512 x 512 


32768 


40017 




1024 x 1024 


131072 


160604 


Peppers 


128 x 128 


2048 


2560 




255 x 255 


8192 


10211 




512 x 512 


32768 


40990 




1024 x 1024 


131072 


163724 


Baboon 


128 x 128 


2048 


2443 




255 x 255 


8192 


9767 




512 x 512 


32768 


39034 




1024 x 1024 


131072 


156308 



Figure. 4 shows the amount of messages hidden in the 3 
cover images. Three stego images (a) Lena (embedded 40017 
bits, PSNR = 42.68d£) (b) Peppers (embedded 40990 
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bits, PSNR = 43.23dB) (c) Baboon (embedded 39034 bits, 
PSNR = 37.71dB). 




(a) (b) 

Fig. 4. Three stego images : (a) Lena (b) Peppers (c) Baboon. 



VI. Conclusion 

In this paper, we have proposed a novel steganographic 
method based on four-pixel differencing and LSB substitution. 
Secret data are hidden into each pixel by the k-bit LSB 
substitution method, where k is decided by the number of 
1 in the most part for pixel. Experimental results showed that 
the proposed method gave best values for the PSNR measure, 
which means that there is no difference between the original, 
and the stegano-images. 
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Abstract — Education, banking, various business and humans' 
necessary needs are made available on the Internet. Day by day 
number of users and service providers of these facilities are 
exponentially growing up. The people face the challenges of how 
to reach their target among the enormous Information on web on 
the other side the owners of web site striving to retain their 
visitors among their competitors. Personalized attention on a 
user is one of the best solutions to meet the challenges. 
Thousands of papers have been published about 

personalization. Most of the papers are distinct either in 
gathering users' logs, or preprocessing the web logs or Mining 
algorithm. In this paper simple codification is performed to 
filter the valid web logs. The codified logs are preprocessed with 
polynomial vector preprocessing and then trained with Back 
Propagation Algorithms. The computational efforts are 
calculated with various set of usage logs. The results are proved 
the goodness of the algorithm than the conventional methods. 

Keywords- web usage mining; Back propagation algorithm;, 
Polynomial vector processing 

I. Introduction 

Web users feel comfortable if they reached the desired web 
page within the minimum navigation on a web site. A study of 
Users' recent behavior on the web will be useful to predict 
their desired target page. Generally Users' browsing patterns 
are stored in the web logs of a web server. These patterns are 
learned through the efficient algorithms to find the target page. 
Backpropagation Algorithm with Polynomial Vector 
Preprocessing,(BPAPVP) is implemented for learning the 
patterns. With learned knowledge, various set of users' 
browsing patterns are tested. The results are observed and 
presented as an analysis on computational efforts of the 
algorithm. The analysis on the results proves the correctness of 
the algorithm. Thus the BPAPVP leads to improved web usage 
mining than the numerous conventional methods. 

A. Literature Review 

Michael Chau et al. [1] attempted to use Hopfield Net 
for web analysis. The web structure and content analysis 
are incorporate into the network through a new design of 
network Their algorithm performed (70% of accuracy) 
better than traditional web search algorithms such as 



breadth-first search(42.6% of accuracy) and best- first 
search algorithms(48.2% of accuracy). David Martens et 
al. [2] proposed a new active learning based approach 
(ALBA) to extract comprehensible rules from opaque 
SVM models. They applied ALBA on several publicly 
available data sets and confirmed its predictive accuracy. 
Dilhan Perera [3] et al. have performed mining of data 
collected from students working in teams and using an 
online collaboration tool in a one-semester software 
development project. Clustering was applied to find both 
groups of similar teams and similar individual members, 
and sequential pattern mining was used to extract 
sequences of frequent events. The results revealed 
interesting patterns characterizing the work of stronger 
and weaker students. Key results point to the value of 
analysis based on each resource and on individuals, rather 
than just the group level. They also found that some key 
measures can be mined from early data, in time for these 
to be used by facilitators as well as individuals in the 
groups. Some of the patterns are specific for their context 
(i.e., the course requirements and tool used). Others are 
more generic and consistent with psychological theories 
of group work, e.g., the importance of group interaction 
and leadership for success. Edmond H.Wu et al.[4] 
introduced an integrated data warehousing and data 
mining framework for website management. The model 
focuses on the page, user and time attributes to form a 
multidimensional can be which can be frequently updated 
and queried. The experiment shown that data model is 
effective and flexible for different analysis tasks. Gaung- 
bin Huang et al. [5] proposed a simple learning algorithm 
capable of real-time learning, which can automatically 
determine the parameters of the network at one time only. 
This learning algorithm is compared with BP and k-NN 
algorithm. There are 4601 instances and each instance 
has 57 attributes. In the simulation 3000 randomly 
selected instances compose the training set and all the rest 
are used for testing. RLA achieves good testing accuracy 
at very fast learning speed; however BP need to spend 
4641.9s on learning which is not realistic in such a 
practical real-time application. In the forest typed 
prediction problem 100,000 training data and 481012 
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testing data have been taken. The testing time of k-NN 
can be as long as 26 hours, where as RLA finished within 
65.648 seconds. Incorporating neural network (NN) into 
supervised learning classifier system (UCS) [6] offers a 
good compromise between compactness, expressiveness, 
and accuracy. A simple artificial NN is used as the 
classifier's action and obtained a more compact 
population size, better generalization and the same or 
better accuracy while maintaining a reasonable level of 
expressiveness negative correlation learning (NCL) is also 
applied during the training of the resultant NN ensemble. 
NCL is shown to improve the generalization of the 
ensemble. Hongjun Lu et al.[7] proposed an neural 
network to extract concise symbolic rules with high 
accuracy. They have been improving the speed of 
network training by developing fast algorithms, the time 
required to extract rules by our neural network approach 
is still longer than the time needed by the decision tree 
approach. They tried to reduce the training time and 
improve the classification accuracy is to reduce the 
number of input units by feature selection. James 
caverlee et al.[8] presented the Thor framework for 
sampling, locating and partitioning the QA-Pagelets ( 
Query- Answer pagelets) from the Deep web. [ Large and 
growing collection of web accessible databases known as 
the deep web ] Their experiments have shown that the 
proposed page clustering algorithm achieves low-entropy 
clusters and the sub-tree clustering algorithms identify 
QA-Pagelets with excellent precision and recall. Lotfi 
Ben Romdhane [9] extends a neural model for casual 
reasoning to mechanize the monotonic class. They 
developed Unified Neural Explainer (UNEX) for casual 
reasoning (independent, incompatibility and open). 
UNEX is mechanized by the use of Fuzzy AND-ing 
networks, whose activation is based on new principle, 
called softmin. They considered a battery of 1000 
random manifestations/cases. UNEX had a coverage 
ration greater than 0.95 in 220 cases (22%). Magdalini 
Eirinaki et al. [10] presented a survey of the use of web 
mining for web personalization. A review of the most 
common methods that are used as well as technical issues 
that occur is given, along with a brief overview of the 
most popular tools and applications available from S/W 
vendors. Mankuan Vai et al.[ll] developed a systematic 
approach that creates a Hop field network to represent 
qualitative knowledge about a system for analysis and 
reasoning. A simple sic node neural network is designed 
as a building block to capture basic qualitative relations. 
The objective of the transistor modelling technique is to 
determine the topology of an equivalent circuit and to 
extract its element values from the measured device data. 
The ultimate advantage of the neural network is in its 
capability of implementing the neural network as a 
parallel distributed processor, which will remove the time 
consuming factor of sequentially updating individual 
neurons. C. Porcel et al. [12] presented a new fuzzy 



linguistic recommender system that facilitates the 
acquisition of the user preferences to characterize the user 
profiles. They allowed users to provide their preferences 
by means of incomplete fuzzy linguistic preference 
relation. The user profile is completed with user 
preferences on the collaboration possibilities with other 
users. Therefore, this recommender system acts as a 
decision support system that makes decisions about both 
the resources that could be interesting for a researcher and 
his/her collaboration possibilities with other researchers to 
form interesting working groups. The experimental results 
shown the user satisfaction with the received 
recommendations. The average of precision, recall and Fl 
(Fl is a combination metric that gives equal weight to 
both precision) metrics are 67.50%, 61.39% and 63.51%, 
respectively. Ranieri Barglia et al.[13] proposed a 
recommender system that helps user to navigate through 
the web by providing dynamically generated links to 
pages that have not been visited and are of potential 
interest. They contributed and suggest, a privacy 
enhanced recommender system that allows for creating 
serendipity recommendations without breaching users 
privacy. They said that a system is privacy safe if the two 
conditions hold: (i) The user activity cannot be tracked (ii) 
The user activity cannot be inferred. They conducted a set 
of experiments assess the quality of recommendations 
Sankar K.Pal et al. [14] summarized the different type of 
web mining and its basic components, along with their 
current states of are. The limitations of existing web 
mining methods / tools are explained. The relevance of 
soft computing is illustrated through example and 
diagrams. Tianyi et al. [15] is examined the problem of 
optimal partitioning of customer bases into homogeneous 
segments for building better customer profiles and have 
presented the direct grouping approach as a solution. That 
approach partitions the customers not based on computed 
statistics and particular clustering algorithms, but in terms 
of directly combining transactional data of several 
customers and building a single model of customer 
behaviour on that combined data. They formulated the 
optimal partitioning problem as a combinatorial 
optimization problem and showed that it is NP-hard. 
Then, three suboptimal polynomial-time direct grouping 
methods, Iterative Merge (IM), Iterative Growth (IG), and 
Iterative Reduction (IR) are shown that the IM method 
provides the best performance among them. It is shown 
that the best direct grouping method significantly 
dominates the statistics-based and one-to-one approaches 
across most of the experimental conditions, while still 
being computationally tractable. It is also shown that the 
distribution of the sizes of customer segments generated 
by the best direct grouping method follows a power law 
distribution and that micro segmentation provides the best 
approach to personalization. Vir V.Phoha et al.[16] 
developed a new learning algorithm for fast web page 
allocation on a server using the self-organizing properties 
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of the neural network (NN). They compared the 
performance of the algorithm with round-robin (RR). As 
the number of input objects increases, the algorithm 
achieves a hit ratio close to 0.98 whereas RR schema 
never achieve more than 0.4. Xiaozhe Wang et al.[17] 
proposed a concurrent neuro-fuzzy model to discover and 
analyze useful knowledge from the available web log 
data. They made use of the cluster information generated 
by self organizing map for pattern analysis and a fuzzy 
inference systerm to capture the chaotic trend to provide 
short-term(hourly) and long-term (daily) web traffic 
trend predictions. Yu-Hui et al.[18] explored a new data 
source called intentional browsing data (IBD)for 
potentially improving the effectiveness of WUM 
applications IBD is a category of online browsing actions 
such as "copy", "scroll", or "save as " and is not recorded 
in web log files. Consequently this research aims to build 
a basic understanding of IBD, which will lead to its easy 
adoption in WUM research and practice. Specially, this 
paper formally defines IBD and clarifies its relationship 
with other browsing data. Zhicheng Douet al. [19] 
developed an evaluation framework based on real query 
logs to enable large-scale evaluation of personalized 
search. They have taken 5 algorithms for evaluation 
research (i) Click-based algorithm (P-Click) , (ii) long- 
term user topic interests ( L-Topic) (iii) Short-term 
interests (S-Topic) (iv) Hybrid of L-Topic and S-Topic, 
(LS-Topic).(v) Group base personalization (G-Click). 
They found that no personalization algorithms can out- 
perform others for all queries and concluded that different 
methods have different strength and weakness. Zi Lu et 
al. [20] reviewed related research results in this area and 
their practical significance for a comprehensive 
explanation of various effect functions based on utility 
theory. They used the data on Internet development in 
China and related intelligent decision models to calculate 
the effect function. Based on the findings, they explained 
the features of the effect of website information flow on 
realistic human flow from various aspects. Research 
results showed that the effect of website information flow 
can be divided into substitution and enhancement, so that 
the relationship of the website information flow in 
guiding the human flow changes from one dimension to 
multi-dimensional morphology. They indicated that, on 
one hand, website information flow is lagged to some 
extent, but is enhanced gradually and grows faster than 
realistic human flow; on the other hand, by comparing the 
evolution trend of the intensity of the two functions, it can 
be seen that the enhancement function occurs later than 
the substitution, but develops faster and has greater force. 
Following comparison between the simulation value and 
the actual value, it is proved that the effect of website 
information flow is basically in line with the relationship 
of realistic human flow. These results can support 
government and business in making decisions on web 
information publication. Through the comparison between 



enhancement effect and substitution effect, they found 
that the substitution and enhancement effect of website 
information flow to realistic human flow exist 
simultaneously. The development trend of the 
enhancement effect is quicker than that of the substitution 
effect, and the enhancement effect is stronger. The 
information flow guiding human flow in the initial period 
of the network economy suggests that the substitution 
effect is stronger, and in the later period that the 
enhancement effect is stronger and quicker. 



II. Problem Definition 

Users' browsing patterns are gathered from the web server and 
then extracts only the valid logs i,e., The logs that doesn't 
contain robots.txt, jpg, ,gif etc and unsuccessful request. 
These logs are codified with Meta data of the web site. Then 
the codified patterns are applied to the polynomial vector for 
preprocessing . The preprocessed data are fed to back 
propagation algorithm for training the usage patterns. 

Machine learning theory based web usage mining assumes 
no statistical information about the web logs. This work falls 
under the category of supervised learning by employing two 
phase strategies such as a) Training phase b) Testing phase. In 
training phase, original logs are codified by simple 
substitution of unique pageid instead of page name for all the 
successful html requests and are interpolate by preprocessing 
into polynomial vector. The n dimensional patterns are inner- 
product to obtain 2 dimensional vectors which is trained by 
neural classifier to learn the nature of the logs. BPA takes the 
role of neural classifier in this work. By training the classifier 
for a specific users' logs a reasonably accurate suggestions can 
be derive. In testing phase, various users' logs are supplied to 
the trained classifier to decide which page-id is to be 
suggested. The flow charts of both phases are given in 
Figurel.a and Figure l.b 
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III. Implementation 

The simulation of personalization through web usage 
mining has been implemented using MATLAB 7®. 
Sample sets of logs are taken from ProtechSC's web 
server. These logs are filtered and codified. Table II 
gives sample codified logs that have been obtained after 
codification of the extended log format. Each number 
refers to a webpage. The % symbol is the comment and 
the number after the percent is the line number. Users' 50 
days patterns have been collected. 25 patterns have been 
used for training and the remaining patterns used for 
testing. 

A. Filter the Log File 

the web logs are collected from the web server of 
www.protechsc.net . Sample web log file of this site is given 
in Fig.2 

TABLE I- CODIFICATION TABLE OF WWW.PROTECHSC.NET 



PageName 


Code 


index.html 


1 


aboutus.html 


2 


Dissertation.html 


3 


Whatwedo.html 


4 


Projecttopics.html 


5 


Services.html 


6 


consultation.html 


7 


Contactus.html 


8 


PaymentDetails.html 


9 


Enquies&Comment.html 


10 


Algorithm 


11 


Flowchart 


12 


Submit 


13 


SpeechSeparation.html 


14 


WaveletPackett.html 


15 


PwdAuthentication.html 


16 


OFDM Frequency.html 


17 


CharRecog.html 


18 


CarotidArtery.html 


19 


AnalysisMRI.html 


20 


BPA Char.html 


21 


DirectSearch.html 


22 


Detect micro classfication.html 


23 


Cloud Contamination.html 


24 


Inforetrieval 


25 



Fingerprint ANN. html 


26 


FacialRecog.html 


27 


Obj ectRecog.html 


28 


HarmonicAnalysis.html 


29 


ImageCompression.html 


30 


ImageDeconvolution.html 


31 


Intrusion.html 


32 


ImageCompression.html 


33 


ImageRestoration.html 


34 


Obj ectTracing.html 


35 


DigitalModulation.html 


36 


EDM Matching.html 


37 


CuttingTool.html 


38 


ToolWear.html 


39 


PowerForecasting.html 


40 


RemoteSpeaker.html 


41 


SpeakerIdentification.html 


42 


SegmentationTextures.html 


43 


Steganalysis.html 


44 


Steganagraphy.html 


45 


SoftSecurity.html 


46 


SurvillanceRobot.html 


47 


TransmitterPlacement.html 


48 


TextureSegmentation.html 


49 


ImageRecovery.html 


50 


WoodDefect.html 


51 


3DFacial.html 


52 



66.249.71.171 -- [25/Miy2009: 18:50:11 +0530] "GET. robots bttHTIP/11" 404- "-" 

"Mizilla- 5 .0 (compatible ; Googlebob' 2. 1 ; +http:.v -www. google.com "boLhtml)" 

66.249.71.171 -- [25.=Miy2009: 13:50:12 +0530] "GET /consul tation.html HTTP/1. 1" 304- 
"-" "MjziHa-5.0 (compatible; Googlebot'2.1; +hHp:. ! 7^T\T\ : .google.com. : 'bothtml) 11 

66.249.71.171 -- L2.5.^fey2009: 19:05:19 +0530] "GET/dissertation.lilml HTIP/1.1" 304- 

"-" "Mjzilla'5.0 (compatible; Googlebob'2.1; 4hHpt .'/www.e oogls.com boLhtml)" 

66.249.71.171 -- [25.=Miy2009: 21:43:07 +0530] "<^T /Contact j^phpHTTO 1.1" 200 
41264 "-" "Mjzilla^.O (compatible; Googlebot'21;+http://T\^Ti-.google.com, : 'bothtml)' 1 

72.20.109.34-- [25.=Mi>72009: 23:01:41 +0530] "GET robots bit HTTP 1.1" 404- "-" 

"Mizilla' 5 .0 (compatible ; GurujiEot/1 .0; +http//www. guruj i.com hl A\ 5bmastsrFAQ.html)" 

72.20.109.34-- [25.=Miy2009: 23:01:41 +0530] "GET /index, html HTTO1.1" 200 23061 "-" 
"Mizilla' 5 .0 (compatible ; GurujiEot/ 1 .0; +http//-www. guruj i.com en W2bmasterFAQ.html) 1 ' 

72.30.79.32-- [27,'Miy2009: 22: 11:42 +0530] "GET/*iiat_w5_do.himl HTIP/1.0" 200 

20954 "-" "Mizilla'D.O (compatible; Yahoo! Slurp"3.0; 
http://help.3ahoo.com'helpus. : '>5earch : "slurp)" 

Figure 2: Sample Web logs ofwww.protechsc.net 

The Filtering Process as follows: 

Step l:Select the logs which don't contain Robots.txt and 
request of image files. 

Step2: Group by IP address of the logs 

Step3: Codify the requested page with following information 

Step 4: Store only IP address, visited page-id into database 
and make use of it for the polynomial preprocessing. 

These steps are pictorially presented in figure 3. 
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WEB SERVER 



Web logs 



Codification table 



Filtered logs 



Select web logs 
with html requests 



Group by 
Ip Address 



Codify the request 



nf is the number of features (nf = 11). 
An outer product matrix X op of the original input vector is 
formed, and it is given by: 



Xop.. 



X1X1 


X1X2 


X1X3 


X1X4 


X1X5.. 


...X1X21 


X2X1 


X2X2 


X2X3 


X2X4 


X2X5.. 


...X2X21 


X3X1 


X3X2 


X3X3 


X3X4 


X3X5.. 


...X3X21 


X4X1 


X4X2 


X4X3 


X4X4 


X4X5.. 


...X4X21 


X5X1 


X5X2 


X5X3 


X5X4 


X5X5.. 


...X5X21 



X21X1 X21X2 X21X3.. X21X4 X21X5 X21X21 



(2) 



Figure 3 . Filtering the Logs 



TABLE II - CODIFIED WEBPAGE DETAILS OF A USER 


a=[l, 2, 3, 4, 5,6, 7, 8,13, 0, 0, 0; %1 


1, 2, 4, 5,14, 9,10,11,13, 8, 3, 0; %2 


1, 2, 4, 5,14,10,11, 5,15,10,13, 4; %3 


5,15, 9,10,11,12,13, 6, 8, 7, 0, 0; %4 


5, 7, 8, 3, 4, 6, 0, 0, 0, 0, 0, 0; %5 


5,16, 9,10,11,12, 3, 7, 8, 0, 0, 0; %6 


5,17,10,11, 3, 6, 8, 1, 0, 0, 0, 0; %7 


5,26, 9,10,11,12, 5,18, 9,10,11,12; %8 


2, 3, 5,27,10,11,12, 6, 8, 0, 0, 0; %9 


2, 4, 7, 5,19,10,11,12,13, 0, 0, 


%10 


2, 6, 5, 3, 4, 0, 0, 0, 0, 0, 0, 


%11 


3, 4, 5, 6, 7, 8, 5,32, 9,10,11, 


%12 


3, 5, 8, 0, 0, 0, 0, 0, 0, 0, 0, 


%13 


3, 7, 5,45, 9,10,11,12, 0, 0, 0, 


%14 


1, 4, 5, 7, 8, 3, 6, 5,38, 9,10,11 


%15 


4, 6, 3, 5,41, 9,10,11,12,13, 8, 1 


%16 


4, 5,18, 9,10,11,12, 2, 0, 0, 0, 


%17 


4, 6, 5, 8, 3, 5, 0, 0, 0, 0, 0, 


%18 


6, 3, 5, 7, 8, 1, 0, 0, 0, 0, 0, 


%19 


6, 7, 4, 3, 5,22, 5,34, 5,17, 9 , 8 


%20 


1, 4, 5,22, 9,10,11, 3, 0, 0, 0, 


%21 


2, 4, 8, 5,29, 5,32, 5,40, 9,10,11 


%22 


3, 6, 7, 5,14, 5,16, 5, 4, 0, 0, 


%23 


7, 8, 3, 4, 5, 0, 0, 0, 0, 0, 0, 


%24 


1, 3, 4, 5,14, 9,10,11,12, 2,13, 0] %25 



B. Polynomial Interpolation 

Polynomial interpolation is the interpolation of a given 
navigation patterns by a polynomial set obtained by outer 
product the given navigation sequence. Polynomial 
interpolation forms the basis for comparing information 
between two points. The pre-processing generates a 
polynomial decision boundary. The pre-processing of the input 
vector is done as follows: 
Let X represents the normalized input vector, 

X = ^X 1 hi=l,..-nf, (1) 

Where X, is the feature of the input vector 



Using the X op matrix, the following polynomials are 

generated: 
(i) Product of inputs (NL1) 
it is denoted by: 
ZwyXi (i^j) = Off-diagonal elements of the outer product 

matrix. (3) 

The pre-processed input vector is a 5 5 -dimensional vector, 
ii) Quadratic terms (NL2) 

It is denoted by: Ew^ 2 = Diagonal elements of the outer 
product matrix. (4) 

The pre-processed input vector is a 1 1 -dimensional vector. 

iii) A combination of product of inputs and quadratic 
terms (NL3) 

It is denoted by: 

ZwijxXi^j) + Swy^ 2 = Diagonal elements and Off-diagonal 
elements of the outer product matrix. (5) 

The pre-processed input vector is a 66 dimensional vector. 

iv) Linear plus NL1 (NL4) 
The pre-processed input vector is a 66 dimensional vector. (6) 

v) Linear plus NL2 (NL5) 
The pre-processed input vector is a 22-dimensional vector. (7) 

vi) Linear plus NL3 (NL6) 
The pre-processed input vector is a 5 5 -dimensional vector. (8) 

In the above polynomials such as NL4, NL5 and NL6 
vector, the term 'linear' represents the normalized input 
pattern without pre-processing. When the training of the 
network is done with a fixed pre-processing of the input vector, 
the number of iterations required is less than that required for 
the training of the network without pre-processing of the input 
vector to reach the desired MSE. The combinations of 
different pre-processing methods with different synaptic 
weight update algorithms are shown in Table III. BPA weight 
update algorithms have been used with fixed pre-processed 
input vectors for learning. 

C. Back Propagation Algorithm 

A neural network is constructed by highly interconnected 
processing units (nodes or neurons) which perform simple 
mathematical operations . Neural networks are characterized 
by their topologies, weight vectors and activation function 
which are used in the hidden layers and output layer. The 
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topology refers to the number of hidden layers and connection 
between nodes in the hidden layers. The activation functions 
that can be used are sigmoid, hyperbolic tangent and sine. The 
network models can be static or dynamic . Static networks 
include single layer perceptrons and multilayer perceptrons. A 
perceptron or adaptive linear element (AD ALINE) refers to a 
computing unit. This forms the basic building block for neural 
networks. The input to a perceptron is the summation of input 
pattern vectors by weight vectors. In most of the applications 
one hidden layer is sufficient. The activation function which is 
used to train the Artificial Neural Network is the sigmoid 
function. 

1) Training 

1 . Read log files and filter it 

2. Separate the data into inputs and target 

3 . Preprocess the data to any NL 

4. Calculate Principal Component Vector by 

Z=Z*ZT (9) 

Where Z denotes the cleaned logs 

5. Train the BP A. 

5. a Forward Propagation 

(i) The weights of the network are initialized. 

(ii) The inputs and outputs of a pattern are presented to the 
network 

(iii) The output of each node in the successive layers is 
calculated. 

O (output of a node) = l/(l+exp(-lW y X,)) (10) 

(iv) The error of a pattern is calculated 



E(p) = (l/2)I(d(p)-o(p)) 2 



(11) 



5.b Backward Propagation 



(i) The error for the nodes in the output layer is calculated 
5(output layer) = o(l-o)(d-o) (12) 

(ii) Weights between output layer and hidden layer are 
updated. 

W(n+1) = W(n) + n5(output layer) o(hidden layer) (13) 

(iii) The error for the nodes in the hidden layer is calculated. 
5(hidden layer) = o(l-o) Z5(output layer) W (updated 
weights between hidden and output layer) (14) 

(iv) The weights between hidden and input layer are 
updated 

W(n+1) = W(n) + n5(hidden layer) o(input layer) (15) 

The above steps complete one weight updating. Second 
pattern is presented and the above steps are followed for the 
second weight updating. When all the training patterns are 
presented, a cycle of iteration or epoch is completed. The 
errors of all the training patterns are calculated and displayed 
on the monitor as the mean squared error (MSE). 

2) Testing 

1 . Read filtered logs and separate into inputs and target 

2. Preprocess the data with a polynomial function 

3. Process with final weights of BPA 

4. Generate the suggestions from the output layer 



5. Present the suggestions through templates 

IV. Results And Discussion 

Figure 4 presents the mean squared error and classification 
performance of BPA without preprocessing the input vectors. 
Fig. 5 to Fig. 10 presents the MSE and classification 
performance of BPA with preprocessed input vectors. The 
computational effort, Mean squared error, the iterations 
required for various algorithm are presented in Table IV. From 
the Table III , it can be noted that , the algorithm with ( BPA 
+NL2 ) requires less number of computational effort to 
achieve minimum 80% classification. 

V. Conclusion 

In this work, a preprocessing approach has been 
implemented for ANN to learn the web usage mining. The 
number of arithmetic operations required to train the network 
with a pre- processed input vector is more, indicating that the 
computational effort is more. The number of iterations 
required is less than that required for the vector without pre- 
processing. The classification performance after 
preprocessing is more than that of the network trained without 
pre-processing. The proposed method has to be tried with 
different types of web sites. 
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Figure 5 MSE and percentage of correct proposed webpage using (BPA+NL1) 
with preprocessing the input vector (Table II ) 
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ure 6. MSE and percentage of correct proposed webpage using 
(BPA+NL2) with preprocessing the input vector (Table II) 
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Figure .7 MSE and percentage of correct proposed webpage using 
(BPA+NL3) with preprocessing the input vector (Table II) 



Figure 4. MSE and percentage of correct proposed webpage using BPA 
without preprocessing the input vector (Table II) 
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Figure. 8 MSE and percentage of correct proposed webpage using (BPA+NL4) 
with preprocessing the input vector (Table II ) 
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Figure. 10 MSE and percentage of correct proposed webpage using 
(BPA+NL6) with preprocessing the input vector (Table II ) 
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Abstract: As technology improve, attackers are trying to get 
access of the network system resources by so many means, open 
loop holes in the network allow them to penetrate in the network 
more easily. Various approaches are tried for classification of 
attacks. In this paper we have compared two methods Naive 
Bayes and Junction Tree Algorithm on reduced set of features by 
improving the performance as compared to full data set. For 
feature reduction PCA is used that helped in proposing a new 
method for efficient classification. We proposed a Bayesian 
network-based model with reduced set of features for Intrusion 
Detection. Our proposed method generates a less false positive 
rate that increase the detection efficiency by reducing the 
workload and that increase the overall performance of an IDS. 
We also investigated that whether conditional independence 
really effect on the attacks/ threats detection. 



Keywords-Network Intrusion Detection 
Bayesain Networks; Junction Tree Algorithm 



System(NIDS); 



I. 



Introduction 



Network Security whether in a commercial organization or 
in a critically important research network, is a major issue of 
concern with the increasing use of web even the personal 
information in under threat. Efficient network intrusion 
detection system is only solution to such threats [4]. 

IDS is a monitoring system of networks to control / avoid / 
secure the networks from cyber terrorist or it is the process of 
examing the events occurring in a network or computer system 
and detecting the signs of incidents which are the threats of 
computer security policies. Network system monitored by the 
IDS for detection of any rules violation. Having such violation 
in the system, efficient IDS generates notification by means of 
an alarm generation that alert the administrator to put some 
steps/major according to such vulnerabilities. Common 
intrusion attacks are classified based on various features/ 
parameter. KDD-99 data set usually used for investigating the 
nature of attack. The data set has 41 features listed. Information 
value of these features and interdependence among them is an 
interest of investigation. How much reduction in features can 
be made without reducing the efficiency of classification 
algorithm and whether interdependency really contributes to 
detection efficiency? We are tried to find the answers of such 
kind of questions in this paper. PCA is an effective data 
dimension reduction technique. Similarly Nai've Bayes' 
classifier and Bayesian Network both use probabilistic 



approach for determination of attack probability. Nai've Bayes' 
classifiers assume conditional independence while Bayesian 
network consider assumes conditional dependence. Two 
methods can be used to compare whether conditional 
independency or interdependency really contribute to 
probability of attack. In the next section we discussed some 
related works which are already proposed, in section 3 we 
discussed the two methods of classification, in section 4 the 
methodology is mentioned and finally in section 5 results and 
discussions are presented. 



II. 



Background 



For intrusion most network based systems become the 
target to the hacker, so building efficient IDS is the main task 
now a day [4]. Intrusion based systems needs a component that 
generates an alerts on the basis of rule set, to detect the 
malicious activity correctly it is necessary to manage the alerts 
correctly [1]. Data Mining approaches are being applied by 
researchers for the attacks detection in their Intrusion Detection 
Systems[2].. Probabilistic approaches for reducing the false 
alarm rate are proposed for example, see [3]. The enormous 
amount of network data traffic is accumulated each day. 
Numbers of data mining approaches are used for collecting 
knowledge domain for intrusion detection which includes 
clustering, association rules and classification [12]. Data 
analysis supports by data mining techniques and now it 
becomes one of the important features/component in intrusion 
based system. The main concern of using data mining 
techniques in attacks detection system to differentiate between 
normal packet vs abnormal. For applying data mining in 
intrusion detection we need a data set and a classification 
model. That classification model may be Bayesian Network, 
neural network, rule based decision tree based and other soft 
computing techniques as Support Vector Machines(SVM) 
[10,11]. Intrusion Detection System is now becomes the 
necessicity for an organizational security system with its 
credibility that may depend upon the data mining techniques. 

2.1 Clustering 

The process of labeling data and arranging it in groups is 
called clustering. By grouping we basically improve the 
performance of different classifiers used. The genuine cluster 
contains data corresponding to single category [5]. The data set 
belongs to the cluster is modeled with respect to them exciting 
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features. You may define the term clustering in such a way that 
it refers as unsupervised machine learning mechanism for 
patterns matching in unlabeled data with numerous 
aspects. 

2.2 Classification 



Bayes classifier is compared with Junction Tree algorithm. For 
modeling Naive Bayes classifier several distribution including 
normal gamma or Poisson density function can be employed. 



3.2 



Junction Tree Algorithm 



In classification we break the data sets into different classes 
and it is much less exploratory than clustering. By means of 
classification we need to classify data into set of classes normal 
/not normal and to sub classify into different types. NaiVe 
Bayes' used as a classification algorithm in this research by 
which data classification for intrusion detection be achieved. 
Due to the collection of huge amount of data traffic needed 
classification is less famous [6]. 



III. Classification Methods 
3.1 Naive Bayes Classifier 

NaiVe Bayes classifier is an effective technique for 
classification of data. The technique is particularly useful for 
large data dimension. The NaiVe Bayes is a special case of 
Bayes theoram which presuppose independence in data 
attributes [7]. Even though Naive Bayes assumes data 
independence, its performance is efficient and at par with other 
techniques assuming data conditionality. Naive Bayes classifier 
can manage continuous or categorical data. Let for a set of 

given variable X={xi,x 2 , x n } with possible outcomes 

0={oi,o 2 , o n }. The posterior probability of the dependent 

variable is obtained by Bayes rule. 



PUcU) = 






P(Oj | x b x 2 , x n ) * P(x b x 2 , x n )Oj P(Oj) 



We can obtain a new case with X with a class label Oj have 



highest posterior probability as 



Its a graphical method of belief updation or probabilistic 
reasoning. For Probabilistic reasoning, we are using Bayesian 
Networks and Decision Graphs (BNDG) for which details can 
be found in [9]. The basic concept in junction tree is clustering 
of predicted attributes [8]. In belief updation instead of 
approximating joint probability distribution of all targeted 
variable (cliques) cluster attributes are formed and potential of 
clusters are used to approximate probability. So basically 
junction tree is the graphical representation of potential cluster 
nodes or cliques and a suitable algorithm to update this 
potential. Junction tree algorithm involve several steps as 
moralizing the graph, triangulation junction tree formulation, 
assigning probabilities to cliques, message passing and reading 
cliques marginal potentials from junction tree. 

Using Junction tree algorithm requires that directed graph 

is changed to undirected graph to ensure uniform application 
process is called moralization which involve ^adding edges 

between parents and dropping the direction let G= (Ng f Eg) 

be a directed graph to be changed into undirected graph G 
(N G ,E G ) so infect two new sets along with EG required to be 
added i.e. 

T7-* 7?-* 

* c ^ G and ^ c 
The set can be defined as 

8^6 l= 10W5) e w 3 : 3Y k e if ->■.&&) e ffgout {x p x k ) e g£j 

In moralization G G ^ G G ^ G is obtained and new 
undirected moralized graph is given as 



pCc^pCq)]^ pGUQ) 



k=I 



The efficiency of Naive Bayes classifier lies in the fact that 
it converts multi dimensionality of data to one dimensional 
density estimation. The occupations of evidence do not affect 
the posterior probability so generally classification task is 
efficient. The same is proved in this study also when Naive 



Junction tree is formed after moralization which is basically 
hyper graphs of cliques if cliques of undirected graph G is 
given by C(G) than junction tree with a unique property that 
intersections of any two nodes is contained in every node in the 
unique path joining the nodes. 

Let consider a cluster representation having to neighbor 
cluster U and V sharing a variable S in common 
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U 



V 



The aim of JTA is to modify potential in such a way that 
the distribution of P (V) is obtained by modified potential 
^(V). In such case probability of S can be given as 



P(S)=I¥(V) 




¥(U) 



Y(S)- 




¥(V) 



Similarly 



P(S) = I^(U) 



Let ¥(S) represent modified potential so ¥(S) = P(S), so 
now if potential of let say ^(V) is delayed as result of new 
evidence f the potential of both ^(S) & ^(U) can be updated 
realizing the equivalence 

¥(U) = P(S) = ¥(V) 

Belief updation in junction tree is carried out through 
message passing let U and V are two adjacent node with 
separator S. so the task is to absorb V and W through S. 
potential ¥(W) and ¥(S) with condition 

X ¥*(W) = ¥*(S) = X ¥*(V) 

In absorption ^*(S) and ^*(W) are replaced as under 
¥*(S) = X ¥(V) 



random sampling. For Naive Bayes classification two data sets 
(stratified sample of equal size of 10000) were used for 
learning and testing using software BN classifier. In junction 
tree algorithm structure learning is carried out by drawing a 
random sample of 5000 from KDD data sets using netica. Then 
five data sets each of size 1000 are selected through simple 
random sample, data set is used for learning and drawing 
junction tree. Data set 2 to 5 were used for testing belief update 
learned by junction tree. 



V. 



Results & Discussion 



The 41 features of KDD '99 data set were reduced to 14 
features. The PC A identified 12 major components having 
Eigen values greater than and around more than 80% 
variability of data explained by these features while 98% 
variability can be explained 24 components. 

The difference of variability between 24 and 14 features 
selection is only 1 8% but computational cost highly increased 
if 24 parameters are selected, so optimize the processing speed 
14 has been selected. It is evident from the graph mentioned 
above that first 24 components represent 98.866% data and 14 
components explained 80% variability which is quite sufficient, 
and work was carried out on these components only, neglecting 
the other components which seem less worthy. Besides this, 
structure learning also support selection of 14 features. The 
Bayesian network model shown in Figure 2 represents 
interdependence among various attributes. It is evident that 
mainly two factors as count & srcjbyte are effected by 
various features and in turn these two ultimately affect the 
attack types. The KDD' 99 data set classification list 18 attack 
types however normal & neptune are more frequent. 



¥*(W) = ¥ (W) 



¥(S) 



In this way belief of the whole network is updated through 
message passing. 



IV. 



Methodology 



KDD '99 data set of intrusion detection was used. PC A 
technique was used and 14 features were selected on the basis 
of analysis. Selection of data set for training and testing plays a 
vital role in accuracy of prediction. In intrusion detection 
frequency of some attacks are very large as compare to others. 
To ensure inclusion of all attacks type in learning stratified 
random sample were drawn relative to proportion of each 
attack type. This produces better result as compare to simple 
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Figure 1: Scree Plot of attributes. 
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Prediction Accuracy of Major Attack Category 
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Figure:2 Bayesian Network Model Intrusion Detection System 



BN classification also supports the importance of these two 
type normal (0.527) and neptune (0.399) in Table 1. The 
probability of features buffer overflow, imap and multihop are 
less than 0.001% and that of ftp _write, guess_pas sword and 
load_module are close to 0. It suggests that this classification 
can be merged. 
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Figure3: Prediction accuracy using BN Classifier 

Figure 4 shows majors attacks category predictions. DoS 
attacks are 99.86% detected while probe attacks about 75% 
detected. 



Figure 4: Prediction Accuracy of Major Attacks 

BN classifier learned more effectively the attack which is more 
frequent. In case of identify normal attacks it showed error rate 
of 0.8% only and identification of most frequent attack neptune 
is 6.8% refers in table 1 . 



Table 1 Accuracy of Classification^ ayesian Classifier) 



Class 


Actual 


Predicted 


Diff 


Error % 


back 


62 


62 








buffer overflow 


2 





2 


100 


guess_passwd 


3 





3 


100 


imap 


2 





2 


100 


ipsweep 


225 


284 


-59 


-26.2 


multihop 


1 





1 


100 


neptune 


2630 


2587 


43 


1.6 


nmap 


96 


35 


61 


63.5 


normal 


4271 


4287 


-16 


-0.37 


phf 


1 





1 


100 


pod 


12 





12 


100 


portsweep 


186 


219 


-33 


-17.7 


rootkit 


1 





1 


100 


satan 


219 


273 


-54 


-24.6 


smurf 


168 


180 


-12 


-7.1 


teardrop 


60 


39 


21 


35 


warezclient 


57 


34 


23 


40.35 


warezmaster 


4 





4 


100 


Total 


8000 


8000 






Tab 


LE 2. PROBABILH 


T OF ATTACK(A 


.VERAGE) 




Class 


Junction 
Tree 


Naive Bayes 
Classifier 


Diff 




back 


0.0102 


0.0086 


0.0016 




buffer overflow 


0.0008 


0.001 


-0.0002 




imap 


0.0006 


0.0005 


0.0001 




ipsweep 


0.0368 


0.0368 







multihop 


0.0002 





0.0002 




neptune 


0.3992 


0.3936 


0.0056 




nmap 


0.0176 


0.0147 


0.0029 




normal 


0.527 


0.5432 


-0.0162 




Total 


1 


1 
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Using junction tree algorithm accuracy of identification is 
utmost 98%. Junction tree also identified neptune as most 
frequent attack. Probability identified of various attacks is 
depicted in table 2. It is evident that estimation of probability 
almost equal. This has been statistically compared that there is 
no significance difference between two methods. Frequencies 
of remaining attacks are very small and their probability almost 
near to zero. 
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VI. Conclusion & Future Recommendations 

Despite the fact that Nai've Bayes classifiers assume 
conditional independence and junction tree algorithm 
parameter interdependence, even though Nai've Bayes and 
junction tree classifiers are almost equally effective. It is 
recommended that only those attacks should be considered 
which are more frequents in order to achieve better 
performance. It is also found that in selection of learning and 
testing data set appropriate sampling techniques are utilized for 
better result prediction. 
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Abstract— Image enhancement is one of the major research 
fields in image processing. In many applications such as 
medical application, military application, media etc., the 
image enhancement plays an important role. There are many 
techniques proposed by different authors in order to remove 
the noise from the image and produce the clear visual of the 
image. Also, there are many filters and image smoothing 
methods available. All these available techniques are designed 
for particular kind of noises. Recently, neural networks turn to 
be a very effective tool to support the image enhancement. 
Neural network is applied in image enhancement because it 
provides many advantages over the other techniques. Also, 
neural network can be suitable for removal of all kinds of 
noises based on its training data. This paper provides survey 
about some of the techniques applied for image enhancement. 
This survey deals with the several existing methods for image 
enhancement using neural networks. 

Keywords— Image Enhancement, Image Denoising, Neural 
Network, Image Filter, Image Restoration. 

I. Introduction 

The intention of image enhancement is to improve the 
interpretability or perception of data in images for human 
visual or to provide better input for other automated image 
processing techniques. 



cause degradation of an image and image restoration is one of 
the key fields in today's Digital Image Processing due to its 
wide area of applications. Commonly occurring degradations 
include blurring, motion and noise. Blurring can be caused 
when object in the image is outside the camera's depth of field 
sometime during the exposure, whereas motion blur can be 
caused when an object moves relative to the camera during an 
exposure. The general model for image degradation 
phenomenon is given as y = Hf + n, where y is the observed 
blurred and noisy image, f is the original image, n is additive 
random noise and H is the blurring operator. The main 
objective is to estimate the original image from the observed 
degraded image. Whatever the degraded process, image 
distortions can fall into two categories, namely, spatially 
invariant or space invariant and spatially variant or space 
variant. In a space invariant distortion all pixels have suffered 
the same form of distortion. This is generally caused by 
problems with the imaging system such as distortions in 
optical system, global lack of focus, or camera motion. In a 
space variant distortion, the degradation suffered by a pixel in 
the image depends upon its location in the image. This is 
because of internal factors, such as distortions in the optical 
system, or by external factors, such as object motion. This 
survey provides many techniques available for image 
enhancement. 

II. Literature Survey 



Image enhancement methods can be broadly divided into two 
categories: 

• Spatial domain methods, which involves direct 
operation on image pixels, and 

• Frequency domain methods, which involves Fourier 
transform of an image for its operation. 

Regrettably, there is no general theory for determining what 
good image enhancement is when it comes to human 
perception. If it looks good, it is good! However, when image 
enhancement methods are used as pre-processing tools for 
other image processing methods, then quantitative measures 
can decide which techniques are most suitable. 

Image Restoration is the technique of retaining the original 
image from the degraded image given the knowledge of the 
degrading factors. There are a variety of reasons that could 



Uma et al, [1] proposed a Morphological Neural Network for 
color image restoration. This paper considers the problem of 
color image restoration degraded by a blur function and 
corrupted by random noise. A new approach developed by 
multilayer morphological (MLM) neural network is presented, 
which uses highly nonlinear morphological neuron for image 
processing to get a high quality restored color image. In this 
paper color images are considered into RGB distribution. Then 
each subspace can be considered as a gray image space and is 
processed by morphological way used in gray images. This 
method is advantageous because of its low computational 
overhead, improved performance in terms of signal to noise 
ratio with less number of neurons. 

Gallo et al, [2] presented an adaptive image restoration using 
local neural approach. This work aims at usage of neural 
learning for defining and experimentally evaluating an 
iterative strategy for blind image restoration in the presence of 
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blur and noise. A salient aspect of the solution is the local 
estimation of the restored image based on gradient descent 
strategies able to estimate both the blurring function and the 
regularized terms adaptively. As an alternative of explicitly 
defining the values of local regularization parameters through 
predefined functions, an adaptive learning approach is 
proposed. 

The various restoration techniques used currently can be 
broadly viewed under two categories, namely, the transform 
related techniques and the algebraic restoration techniques [3]. 
The transform related techniques involve analyzing the 
degraded image after an appropriate transform has been 
applied. The two popular transform related techniques are 
inverse filtering and Kalman filtering [4]. Inverse filtering 
produces a perfect restoration in the absence of noise, but the 
presence of noise has very bad effects. The Kalman filter 
approach can be applied to non stationary image but it is 
computationally very intensive. 

Algebraic techniques attempt to find a direct solution to the 
distortion by matrix inversion techniques, or techniques 
involving an iterative method to minimize a degradation 
measure. The two popular algebraic techniques available are 
pseudo inverse filtering and constrained image restoration. 
The pseudo inverse spatial image restoration techniques 
attempt to restore an image by considering the vector space 
model of the image degradation and attempting to restore the 
image in this vector space domain. This method does not 
consider the effects of noise in the calculations of the pseudo 
inverse and so is sensitive to noise in the image. This involves 
determining an approximation to the inverse of the matrix 
blurring operator which is multiplied with the column scanned 
image vector to produce the degraded image. Blur matrices are 
very large and it is not computationally feasible to invert them. 
Constrained restoration techniques are often based on Wiener 
estimation and regression techniques. One of the major 
drawbacks in most of the image restoration algorithms is the 
computational complexity, so various simplifying assumptions 
have been made to obtain computationally feasible algorithms. 

Motivated by the biological neural network in which the 
processing power lies in a large number of neurons linked 
with synaptic weights, artificial neural network models 
attempt to achieve a good performance via dense 
interconnection of simple computational elements. Neural 
network models have great potential in areas where high 
computation rates are required and the current best systems are 
far from equaling human performance. Restoration of a high 
quality image from a degraded recording is a good application 
area of neural nets. Joon et al, [5] proposed a Modified 
Hopfield neural network model for solving the restoration 
problem which improves upon the algorithm proposed by 
Zhou et al. [6]. 

Osman et al, [7] gives an image enhancement using bright 
and dark stretching techniques for tissue based tuberculosis 
bacilli detection. This paper proposes two methods for color 
image enhancement; bright stretching and dark stretching 
algorithms. Both techniques are well known to create good 



image enhancement for gray scale images. But, the current 
study has adapted these techniques to be used for color 
images. Even though the adapted image processing method is 
quite simple, the results signify that these methods may have 
some potential to be used for improving the quality of Ziehl 
Neelsen slide images. The experimental result illustrates that 
both methods proposed by the author can improve the image 
contrast and enhances the image quality when compared to its 
conventional techniques. 

Pattern learning based image restoration using neural networks 
is put forth by Dillon et al, [8]. The author illustrate a generic 
pattern learning based image restoration scheme for degraded 
digital images, where a feed-forward neural network is 
employed for implementation of the proposed techniques. The 
methodology reported here can be applied in several 
circumstances, for instance, quality enhancement as a post- 
processing of image compression schemes, blur image 
restoration and noise image filter, provided that the training 
data set is comprised of patterns rich enough for supervised 
learning. This paper focuses on the problem of coded image 
restoration. The key points addressed in this work are 

• The use of edge data extracted from source image as 
a priori knowledge in the regularization function to 
get better details and reduce the ringing artifact of the 
coded images. 

• The theoretic basis of the pattern learning-based 
technique using implicit function theorem. 

• Subjective quality improvement with the use of an 
image similarity for training neural networks 

• Empirical studies with contrast to the set partitioning 
in hierarchical tree (SPIHT) method. 

The main advantages of this model-based neural image 
restoration approach comprise strong robustness with respect 
to transmission noise and the parallel processing for real-time 
applications. 

Reeves [9] described fast and direct image restoration with 
edge-preserving regularization. In several applications, fast 
restorations are required to keep up with the frame rate. FFT- 
based restoration affords a fast implementation, but it does so 
at the expense of assuming that the degree of regularization is 
constant over the image. Unfortunately, this hypothesis can 
generate significant ringing artifacts in the presence of edges 
as well as edges that are blurrier than necessary. Shift-variant 
regularization affords a way to vary the roughness penalty as a 
function of spatial coordinates. Virtually all edge-preserving 
regularization techniques exploit this concept. However, this 
technique destroys the structure that makes the use of the FFT 
possible, since the deblurring operation is no longer shift- 
invariant. Thus, the restoration techniques available for this 
problem no longer have the computational efficiency of the 
FFT. The author proposes a new restoration method for the 
shift- variant regularization approach that can be implemented 
in a fast and flexible manner. This paper decomposes the 
restoration into a sum of two independent restorations. One 
restoration yields an image that comes directly from an FFT- 
based approach. This image is a shift-invariant restoration 
consisting of usual artifacts. The other restoration involves a 
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set of unknowns whose number equals the number of pixels 
with a local smoothing penalty significantly different from the 
typical value in the image. This restoration represents the 
artifact correction image. By summing the two, the artifacts 
are canceled. Since the second restoration has a significantly 
reduced set of unknowns, it can be calculated very efficiently 
even though no circular convolution structure exists. 

Noise-refined image enhancement using multi-objective 
optimization is illustrated by Peng et al, [10]. This paper 
presents a novel scheme for the enhancement of images using 
stochastic resonance (SR) noise. In this scheme, a suitable 
dose of noise is added to the lower quality images such that 
the performance of a suboptimal image enhancer is improved 
without altering its parameters. In this paper, image 
enhancement is modeled as a constrained multi-objective 
optimization (MOO) problem, with similarity and some 
desired image enhancement characteristic being the two 
objective functions. The principle of SR noise-refined image 
enhancement is analyzed, and an image enhancement system 
is developed. A genetic algorithm-based MOO technique is 
employed to find the optimum parameters of the SR noise 
distribution. In addition, a novel image quality evaluation 
metric based on human visual system (HVS) is developed as 
one of the objective functions to guide the MOO search 
procedure. 

Lu et al, [11] proposed an image noise reduction technique 
based on the fuzzy rules. Considering the image as non- 
stationary signal, an image noise reduction method based on 
the fuzzy rules is proposed. This image processing system 
(IPS) is recognized as a time- variant system in which the 
system parameters change continuously based on the local 
characteristics of the images. For the purpose of noise 
reduction, Gaussian noise is considered here. The fuzzy rules 
are implemented to consider the unstableness and uncertainty 
of signals. The nonlinear function indicating the fuzzy rule- 
based IPS depends on the rules concerning the local 
characteristics of the input, on the membership functions, and 
on the used defuzzification method. For making the system 
performance as high as possible, these factors must be agreed 
to be the most appropriate ones. In this paper a technique for 
designing the optimum nonlinear function directly from the 
local characteristics of training data is presented. Here the 
rules, the membership functions, and the technique of 
defuzzification are not essential to be known. The design of 
these factors is concerned in the design of the membership 
function, thus attaining the optimum nonlinear function is 
sufficient for designing the IPS. The only thing required to do 
is to choose what sort of the local characteristics of the image 
should be applied to the rule-based system. Computer 
simulations illustrate that the proposed technique gives better 
results in comparison with that of the weighted averaging filter 
and median filter. 

An adaptive fuzzy image enhancement algorithm for local 
regions is given by Yan et al, [12]. To overcome the 
drawbacks of low speed and losing image information in fuzzy 
image enhancement algorithms, a novel fuzzy enhancement 
operator with close-character and transplantable-character is 



proposed in this paper. The approached operator utilizes the 
gradient operator to create the image enhancement processing 
focus on the interested regions, and the OTSU operator to 
automatically select the best threshold value, which can realize 
a novel adaptive fuzzy image enhancement algorithm for local 
regions. Through the experimentations of the asphalt 
pavement crack image detection system, the experimental 
results specify that the novel algorithm can not only attain 
better processing effects and higher processing speed than 
now-available fuzzy image enhancement algorithms, but also 
possess the property of high practicability and generality. 

Faouzi et al, [13] provides a directional-rational approach for 
color image enhancement. In this, the author presents an 
unsharp masking-based approach for noise smoothing and 
edge enhancing in multichannel images. The structure 
presented by author is similar to the conventional unsharp 
masking structure, however, the enhancement is allowed only 
in the direction of maximal change and the enhancement 
parameter is computed as a nonlinear function of the rate of 
change. This scheme improves the true details, limits the 
overshoot near sharp edges and attenuates noise in flat areas. 
In addition the use of the control function eliminates the need 
for the subjective coefficient X used in the conventional 
unsharp masking method. 

The noise reduction based on fuzzy image filtering is put forth 
by Dimitri et al, [14]. A new fuzzy filter is provided for the 
noise reduction of images corrupted with additive noise. The 
filter involves two stages. The initial stage calculates a fuzzy 
derivative for eight different directions. The next stage uses 
these fuzzy derivatives to carry out fuzzy smoothing by 
weighting the contributions of neighboring pixel values. Both 
these stages are dependent on fuzzy rules which make use of 
membership functions. The filter can be implemented 
iteratively to effectively decrease heavy noise. Especially, the 
shape of the membership functions is adapted according to the 
remaining noise level after each iteration; making use of the 
distribution of the homogeneity in the image. A statistical 
technique for the noise distribution can be included to relate 
the homogeneity to the adaptation scheme of the membership 
functions. 

Gacsadi et al, [15] makes use of cellular neural network for 
the purpose of image enhancement. This technique takes both 
the denoising and the increase of the contrast into 
consideration. Due to whole parallel processing, computing- 
time reduction is achieved. In the enhancement process by 
usage of nonlinear and feedback template local and also 
regional properties will be taken into consideration due to the 
propagation of the effect between the neighbors. Considerable 
computing power is required to solve the image processing 
task described by variational computing. The Cellular Neural 
Networks (CNN) proved to be very useful regarding real-time 
image processing. The reduction of computing time, due to 
parallel processing, can be obtained only if the processing 
algorithm can be implemented on a CNNUC or by using 
emulated digital CNN-UM implemented on FPGAs. 
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The traditional analog architectures of CNN-UC are superior 
in terms of processing speed and power dissipation. However, 
these implementations have a restricted applicability due to 
their limited features regarding accuracy, flexibility, small 
number of cells, their high cost and the lengthy period needed 
for the development of such a chip. On the other hand, while 
software solutions are extremely flexible, they are sometimes 
inefficient because of limited and low processing speed. 
Today, the version of CNN digital emulator implemented 
FPGA is a solution that achieved a compromise between speed 
and accuracy, but ensure repeatability, reproducibility, 
flexibility, possibility for CNN implementation even for 
complex processes of processing and easy interfacing with 
digital systems. In this sense, the CNN digital emulator 
implemented FPGA maximizes, for a concrete application, the 
performances of the overall CNN processing. 

There are two basic approaches to image denoising [16] - 
spatial filtering methods and transform domain filtering 
methods. 

Spatial Filtering 

A conventional way to remove noise from image data is to 
employ spatial filters. Spatial filters can be further classified 
into non-linear and linear filters. 

i. Non-Linear Filters 
With non-linear filters, the noise is removed without any 
attempts to explicitly identify it. Spatial filters utilize a low 
pass filtering on groups of pixels with the assumption that the 
noise occupies the higher region of frequency spectrum. 
Generally spatial filters remove noise to a reasonable extent 
but at the cost of blurring images which in turn makes the 
edges in pictures invisible. In recent years, a variety of 
nonlinear median type filters such as weighted median, rank 
conditioned rank selection, and relaxed median have been 
developed to overcome this drawback. 

ih Linear Filters 
A mean filter is the optimal linear filter for Gaussian noise in 
the sense of mean square error. Linear filters also tend to blur 
sharp edges, destroy lines and other fine image details, and 
perform poorly in the presence of signal-dependent noise. The 
wiener filtering method requires the information about the 
spectra of the noise and the original signal and it works well 
only if the underlying signal is smooth. Wiener method 
implements spatial smoothing and its model complexity 
control correspond to choosing the window size. 

Sarode et ah, [17] proposed the color image enhancement with 
the help of fuzzy system. This technique involves the use of 
knowledge-base (fuzzy expert) systems that are capable of 
mimicking the behavior of a human expert. Fuzzy technique 
of knowing severity of tumor is essential to determine if there 
is a need for the biopsy and it gives to user a clear idea of 
spread and severity level of tumor. Fuzzy based improvement 
of color feature of tumor is an application of fuzzy in the area 
of color feature extraction for enhancement of a peculiar 
feature. It has been determined that RGB color model is not 



appropriate for enhancement because the color components 
are not decoupled. Alternatively, in HSV color model, hue 
(H), the color content, is separate from saturation (S), which 
can be used to dilute the color content and V, the intensity of 
the color content. By conserving H, and modifying only S and 
V, it is likely to enhance color image. Therefore, it is required 
to convert RGB into HSV for the purpose. A Gaussian type 
membership function is utilized to model S and V property of 
the image. This membership function utilizes only one 
fuzzifier and is evaluated by maximizing fuzzy contrast. 

Muthu Selvi et ah, [18] put forth a hybrid image enhancement 
technique for noisy dim images using curvelet and 
morphology techniques. The noisy dim images degrade the 
image quality. The denoising method using curvelet transform 
outperforms than wavelet transform. The noisy dim image is 
made noise free with the help of curvelet transform to the dim 
image for avoiding the over illumination and under 
illumination problems. Next the dim image is enhanced using 
the morphological transformations. Closing by reconstruction 
is implemented to identify the background of the dim image. 
The experimental result shows that the morphological 
restoration filter with closing by reconstruction produces 
better result than opening by reconstruction. 

Hassan et ah, [19] presented a contrast enhancement technique 
for dark blurred images. The chief goal presented by the 
author is to produce a contrast enhancement technique to 
recover an image within a given area, from a blurred and 
darkness specimen, also improve visual quality of it. This 
method consists of two steps unsharp masking step and 
contrast enhancement step. The unsharp masking step is 
applied to the image to sharpen edges and bring out hidden 
details. On the contrary enhancement step 3x3 slider map 
window was applied to the image to determine if the 
corresponding pixel will be remapped or not. The new value 
of remapped pixel obtained is based on a sigmoid map 
function. Good and satisfying results were obtained by 
experimentation on this technique. 

Gilbao et ah, [20] uses the complex diffusion processes for 
image enhancement and denoising. The linear and nonlinear 
scale spaces, obtained by the inherently real- valued diffusion 
equation, are generalized to complex diffusion processes, by 
incorporating the free Schrodinger equation. A basic solution 
for the linear case of the complex diffusion equation is 
developed. Investigation of its performance shows that the 
generalized diffusion process combines properties of both 
forward and inverse diffusion. It verifies that the imaginary 
part is a smoothed second derivative, scaled by time, when the 
complex diffusion coefficient approaches the real axis. Based 
on this observation, the authors develop two examples of 
nonlinear complex processes, useful in image processing: a 
regularized shock filter for image enhancement and a ramp 
preserving denoising process. 

Image denoising using non-negative sparse coding shrinkage 
algorithm is given by Shang et ah, [21]. The author proposes a 
new method for denoising natural images using this extended 
non-negative sparse coding (NNSC) neural network shrinkage 
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algorithm, which is self-adaptive to the statistic property of 
natural images. The fundamental principle of denoising using 
NNSC shrinkage is similar to that using standard sparse 
shrinkage and wavelet soft threshold. Using test images 
corrupted by additive Gaussian noise, this paper evaluated the 
method across a range of noise levels. This method utilized the 
normalized mean squared error as a measure of the quality of 
denoising images and the signal to noise rate (SNR) value as 
an evaluative feature of different denoising approaches. The 
experimental result shows that the NNSC shrinkage certainly 
is effective in image denoising. Otherwise, the author also 
compares the effectiveness of the NNSC shrinkage with sparse 
coding shrinkage and wavelet soft threshold method. The 
simulative tests show that this denoising method outperforms 
any other of the two kinds of denoising approaches. 

Gupta et al., [22] designed a FIR filter for image restoration 
using principal component neural network. The neural 
network can be applied in many image denoising applications 
because of its inherent characteristics such as nonlinear 
mapping and self-adaptiveness. The design of filters widely 
depends on the a-priori knowledge about the type of noise. 
Because of this, standard filters are application and image 
specific. Extensively used filtering algorithms reduce noisy 
artifacts by smoothing. Though, this operation normally 
results in smoothing of the edges as well. Alternatively, 
sharpening filters enhance the high frequency details making 
the image non-smooth. An integrated general technique to 
design a finite impulse response filter based on principal 
component neural network (PCNN) is provided in this study 
for image filtering, optimized in the sense of visual inspection 
and error metric. This technique utilizes the inter-pixel 
correlation by iteratively updating the filter coefficients using 
PCNN. This technique performs optimal smoothing of the 
noisy image by preserving high and low frequency features. 
Experimental results state that this filter is robust under 
various noise distributions. Additionally, the number of 
unknown parameters is very few and most of these parameters 
are adaptively obtained from the processed image. 



proposed method offering the best efficiency in terms of 
image denoising and edge preservation. 

Zhang et al, [25] presented image denoising using a neural 
network based non-linear filter in wavelet domain. Images are 
often distorted as a result of various factors that can occur 
during acquisition and transmission processes. Image 
denoising is intended at removing or reducing noise, so that a 
good-quality image can be obtained for various applications. 
The author presents a neural network based denoising method 
implemented in the wavelet transform domain. A noisy image 
is first wavelet transformed into four subbands, and then a 
trained layered neural network is applied to each subband to 
generate noise-removed wavelet coefficients from their noisy 
ones. The denoised image is then obtained through the inverse 
transform on the noise-removed wavelet coefficients. 
Compared with other techniques performed in the wavelet 
domain, it requires no a priori knowledge about the noise and 
needs only one level of signal decomposition to obtain very 
good denoising results. 

III. Conclusion 

This survey discusses about several existing image 
enhancement methods. All those methods discussed have their 
own advantages and disadvantages. This survey helps in 
choosing the better suitable image enhancement scheme for 
particular kind of noise in the image. Also, the filtering 
algorithms used for removing the noise from the image are 
presented in this thesis. This survey explains about the 
importance of neural network for image enhancement as the 
neural networks have the advantages such as nonlinear 
mapping and self-adaptiveness. To overcome the demerit of 
the techniques discussed in this survey, Adaptive Neuro-Fuzzy 
Interference Systems (ANFIS) can be used for image 
enhancement as it combines the advantages of Artificial 
Neural Networks (ANN) and Fuzzy Interference System 
(FIS). 



Image denoising based on combined neural networks filter is 
proposed Junhong et al. ,[23]. A new image restoration 
technique based on combined neural networks Alter is 
proposed by the author. This integrated neural networks Alter 
is posed by a BPNN Alter and an image data fusion system 
based on self-organizing mapping neural networks. And this 
technique can use the corrupted image itself as training data to 
avoid the problem of how to choose the training data, which is 
most of the other neural networks denoising methods have to 
face, by using the distributed character of WGN. Experiment 
results show that this method can denoise the noises 
effectively. 

Gacsadi et al, [24] describes PDE-based medical images 
denoising using Cellular Neural Networks. The author 
presents the medical image denoising by using cellular neural 
networks (CNN), based on the variational model of Chan and 
Esedoglu. By comparatively examining the proposed method 
and other CNN methods that uses variational computation, the 
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Abstract — With the persistent growth of the World Wide Web, 
the difficulty is increased in the retrieval of relevant information for a 
user's query. Present search engines offer the user with several web 
pages, but different levels of relevancy. To overcome this, the 
Semantic Web has been proposed by various authors to retrieve and 
utilize additional semantic information from the web. As the 
Semantic Web adds importance for sharing knowledge on the internet 
this has guide to the development and publishing of several 
ontologies in different domains. Using the database terminology, it 
can be said that the web-ontology of a semantic web system is 
schema of that system. As web ontology is an integral aspect of 
semantic web systems, hence, design quality of a semantic web 
system can be deliberated by measuring the quality of its web- 
ontology. This survey focuses on developing good ontologies. This 
survey draws upon semiotic theory to develop a suite of metrics that 
assess the syntactic, semantic, pragmatic, and social aspects of 
ontology quality. This research deliberates about the metrics that may 
contribute in developing a high quality semantic web system. 



Keywords — Quality Metrics, Web ontology, Semiotic Metrics, 
Semantic Quality, Domain modularity. 



II. 



Literature Survey 



I. 



Introduction 



SEMANTIC Web is nothing but the extension of the 
present web in which the web resources are prepared with 
formal semantics about their interpretation for the machines. 
These web resources are combined in the form of web 
information systems, and their formal semantics are usually 
characterized in the form of web -ontologies. By means of the 
database terminology, it can be said that the web-ontology of a 
semantic web system is representation of that system [11]. 
Design quality of a semantic web system can be calculated by 
computing the quality of its web-ontology because web 
ontology is the integral element of semantic web systems [25]. 
The main concern is that when the design of a web-ontology is 
completed, it is suitable time to assess its quality so that in 
case, the design is of low quality, it can be enhanced before its 
instantiation. This helps in saving of considerable amount of 
cost and effort for developing high quality semantic web 
systems. Metrics are considered as the appropriate tools for 
estimating quality. This survey focuses on several metrics for 
web ontology quality evaluation. 



Ahluwalia et al, [1] presented a Semiotic Metrics Suite for 
Assessing the Quality of Ontologies. Table 1 shows some of 
the metrics for quality evaluation [1,3]. 

As a decisive construct, overall quality (Q) is a subjective 
function of its syntactic (S), semantic (E), pragmatic (P), and 
social (O) qualities [1] (i.e., Q = blxS + b2xE + b3xP + 
b4xO). The addition of weight is equal to 1. In the absence of 
pre-specified weights, the weights are assigned to be equal. 

Syntactic Quality (S) evaluates the quality of the ontology 
according to the way it is written. Lawfulness is the extent to 
which an ontology language's rules have been obeyed. Not 
every ontology editors have error-checking capabilities; 
however, without correct syntax, the ontology cannot be read 
and used. Richness is nothing but the proportion of features in 
the ontology language that have been used in ontology (e.g., 
whether it includes terms and axioms, or only terms). Richer 
ontologies are more valuable to the user (e.g., agent). 

Semantic Quality (E) estimates the meaning of terms in the 
ontology library. Three attributes are used here are 
interpretability, consistency, and clarity. Interpretability deals 
with the meaning of terms (e.g., classes and properties) in the 
ontology. In the real world, the knowledge provided by the 
ontology can map into meaningful concepts. This is 
accomplished by checking that the words used by the ontology 
be present in another independent semantic source, such as a 
domain- specific lexical database or a comprehensive, generic 
lexical database such as WordNet. Consistency is nothing but 
whether terms having a consistent meaning in the ontology. 
For example, if an ontology claims that X is a subclass of Y, 
and that Y is a property of X, then X and Y have incoherent 
meanings and are of no semantic value. For example, 
ontological terms such as IS -A is often used inconsistently. 
Clarity is the term which determines whether the context of 
terms is clear. For example, if ontology claims that class 
"Chair" has the property "Salary," an agent must know that 
this illustrate academics, not furniture. 

Pragmatic Quality (P) deals with the ontology's usefulness 
for users or their agents, irrespective of syntax or semantics. 
Three criteria are used for determining P. Accuracy is whether 
the claims on ontology makes are 'true.' This is very tricky to 
determine automatically without a learning mechanism or 
truth maintenance system. Currently, a domain expert 
evaluates accuracy. The measure of the size of the ontology is 
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called as Comprehensiveness. Larger ontologies are more whether the 
probable to be complete representations of their domains, and requirements, 
provide more knowledge to the agent. Relevance indicates 



ontology satisfies the agent's specific 



Table 1: Determination of Metric Values 



Attributes 


Determination 


Overall Quality (Q) 


Q = bl.S + b2.E + b3.P + b4.0 


Syntactic Quality (S) 


S = bsl.SL + bs2.SR 


Lawfulness (SL) 


Let X be total syntactical rules. Let X b be total breached rules. Let NS 
be the number of statements in the ontology. Then SL = X b / NS. 


Richness (SR) 


Let Y be the total syntactical features available in ontology language. 

Let Z be the total syntactical features used in this ontology. 

Then SR = Z/Y. 


Semantic Quality (E) 


E = bel.EI + be2.EC + be3.EA 


Interpretability (EI) 


Let C be the total number of terms used to define classes and properties 

in ontology. 
Let W be the number of terms that have a sense listed in WordNet. Then 

EI = W/C. 


Consistency (EC) 


Let 1 = 0. Let C be the number of classes and properties in ontology. 

VCi, if meaning in ontology is inconsistent, 1+1. Therefore, I = number 

of terms with inconsistent meaning. Ec = I/C. 


Clarity (EA) 


Let Ci = name of class or property in ontology. V Ci, count Ai, (the 
number of word senses for that term in WordNet). Then EA = A/C. 


Pragmatic Quality (P) 


P = bpl.PO + bp2.PU + bp3.PR 


Comprehensiveness (PO) 


Let C be the total number of classes and properties in ontology. Let V 
be the average value for C across entire library. Then PO = C/V. 


Accuracy (PU) 


Let NS be the number of statements in ontology. Let F be the number of 

false statements. PU = F/NS. Requires evaluation by domain expert 

and/or truth maintenance system. 


Relevance (PR) 


Let NS be the number of statements in the ontology. Let S be the type of 

syntax relevant to agent. Let R be the number of statements within NS 

thatuseS.PR = R/NS. 


Social Quality (0) 


= bol.OT + bo2.0H 


Authority (OT) 


Let an ontology in the library be OA. Let the set of other ontologies in 

the library be L. Let the total number of links from ontologies in L to 

OA be K. Let the average value for K across ontology library be V. 

ThenOT = K/V. 


History (OH) 


Let the total number of accesses to an ontology be A. Let the average 
value for A across ontology library be H. Then OH = A/H. 


Cohesion (Coh) 


Coh=ISCCI 
Where SCC is separate connected components 


Fullness (F) 


IQCOI 

|C\(/)| 


Readability (Rd) 


Rd = \A,A = rdfs: comment\ + \A,A = rdfs: label\ 



For the purpose of evaluation, it needs some knowledge of 
the agent's requirements. This metric is coarse as it verifies for 



the type of information the agent uses by ontology (e.g., 
property, subclass, etc), rather than the semantics needed for 
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specific tasks (e.g., the particular subclasses needed to 
interpret a user's specific query). 

Social quality (O) imitates the fact that agents and 
ontologies exist in communities. The authority of an ontology 
is nothing but the number of other ontologies that link to it 
(define their terms using its definitions). More authoritative 
ontologies indicate that the knowledge they provide is 
accurate or useful. The history indicates the number of times 
the ontology is accessed. Ontologies are more dependable 
when they are with longer histories. 

The cohesion (Coh) of a KB is nothing but the number of 
separate connected components (SCC) of the graph 
representing the KB . 

The fullness (F) of a class Q is defined as the actual number 
of instances that belong to the subtree rooted at Q (Q(I)) 
compared to the expected number of instances that belong to 
the subtree rooted at Q (Q V (I)). 

The readability (Rd) of a class Q is defined as the total of 
the number attributes that are comments and the number of 
attributes that are labels the class has. 

Amjad et al, [2] provided the Web-Ontology Design 
Quality Metrics. The author proposes design metrics for web- 
ontology [21] by maintaining certain recommended principles 
like a metric may reach its highest value for perfect quality for 
excellent case and vice versa that is it may reach its lowest 
level for worst case. It is supposed to be monotonic, clear, and 
intuitive. It must correlate well with human decisions and it 
should be automated if possible. The proposed metrics may 
give notification about how much knowledge can be derived 
from a given webontology; how much it is relevant to a user's 
specific necessities and how much it is effortless to reuse, 
manage, trace and adapt. The metrics provided by the author 
are Knowledge Enriched (KnE), Characteristics Relevancy 
(ChR) and Domains modularity (DoM). 

Knowledge Enriched metric 

The reasoning capability of a web-ontology is determined 
by Knowledge Enriched (KnE) metric, and it is based on two 
sub-metrics so-called Isolated Axiom Enriched (IAE) metric 
and Overlapped Axiom Enriched (OAE) metric. There are 
three parts in this axiom namely, predicate, resource and 
object. If none of these is similar with any other axiom of 
identical domain then that axiom is termed as isolated axiom. 
If the two axioms have some similar parts, it is said to be 
overlapped. There may be more than a few transitively 
overlapped axioms in any domain. This metric determines the 
percentage of IAE and OAE, and if the former is greater than 
the later one, then the web-ontology can be regarded as less 
knowledge enriched. IAE is officially defined as the ratio of 
total number of isolated axioms (tIAs) to the total number of 
domain axioms (tDAs). 



IAE 



-x 



tIAs 



\tDAs 

i = i 

for alll < i <n 



(1) 



In the above equation, n is total number of sub-domains of 
web-ontology. Similarly, the OAE metric is officially defined 
as ratio of total number of overlapped axioms (tOAs) to the 
total number of domain axioms. It can be written as follows: 



OAE 



ZtOA { 
tDAs 

i = i 

for alll < i < n 



(2) 



In the equation given above, n is total number of sub- 
domains of web-ontology. Lastly, the KnE metric is the 
difference of total number of overlapped axioms and the total 
number of isolated axioms. It may be written as follows: 



KnE = OAE -IAE 



(3) 



If the resultant KnE value is positive, then the web-ontology 
is more knowledge enriched, if it is zero, then the web- 
ontology is average knowledge enriched, and if it is negative, 
then the web-ontology is less knowledge enriched. 

Characteristics Relevancy metric 

Characteristics Relevancy (ChR) metric gives us the 
suggestion about how much a given web-ontology is close to a 
user's specific necessities and the degree of reusability of the 
web -ontology. Formally, it is termed as the ratio of the 
number of relevant attributes (nRAs) in a class to the total 
number of attributes (TnAs) of that class. It can be written as 
follows: 



ChR 



i 



nRAs 
TnAs 



for all 1 < i <n 



(4) 



where n in above equation represents the total number of 
classes in the provided web-ontology. ChR metric reveals the 
proportion of relevant attributes in the web-ontology, and this 
number gives insights how much a web-ontology is relevant. 

Domain Modularity metric 

Domain modularity (DoM) metric denotes the component- 
orientation feature of a web-ontology. This metric specifies 
the grouping of knowledge in different components of web- 
ontology. The webontology is best manageable, traceable, 
reusable and adaptable, if it is designed in components 
(subdomains). Formally, the DoM metric is given as the 
number of sub-domains (NSD) contained in a webontology. 
This metric also depends on the coupling and cohesion [25] 
levels of sub-domains, and it is directly proportional to its 
cohesion level and inversely proportional to its coupling level. 
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DoM = NSD + 



ZDCohi + l/V DCoupt 
' h (5) 

7-1 for alll<i<n 



In the above equation, DCoh indicates the level of domain 
cohesion and DCoup represents the level of coupling among 
sub-domains of web-ontology domain. DoM metric is a real 
number indicating the degree of partial reusability of a given 
web-ontology. 

Samir et ah, [3] given the OntoQA: Metric -Based Ontology 
Quality Analysis. The metrics presented can highlight key 
characteristics of an ontology schema and also its population 
and facilitate users to make an informed judgment easily. The 
metrics used by the author here are not 'gold standard' 
measures of ontologies. Instead, the metrics are projected to 
estimate several aspects of ontologies and their potential for 
knowledge representation. Rather than describing ontology as 
merely effective or ineffective, metrics describe a certain 
aspect of the ontology because, in most cases, the way the 
ontology is built is largely dependent on the domain in which 
it is designed. The metrics defined here are Schema Metrics 
and Instance Metrics. The following are metrics considered by 
the author: 

The following are some of Schema Metrics: 

Relationship Richness: The diversity of relations and 
placement of relations in the ontology is defined by this 
metrics. An ontology that has many relations further than 
class-subclass relations is better than taxonomy with no more 
than class-subclass relationships. The relationship richness 
(RR) is defined as the ratio of the number of relationships (P) 
defined in the schema to the sum of the number of subclasses 
(SC) plus the number of relationships. 



RR 



\P\ 



\SC\ + \P\ 



Attribute Richness: The attribute richness (AR) is defined as 
the average number of attributes (slots) per class. It is given as 
the ratio of number attributes for all classes (att) to the number 
of classes (C). 

\att\ 

Inheritance Richness: The inheritance richness of the 
schema (IRs) is defined as the average number of subclasses 
per class. The number of subclasses (CI) for a class Ci is 
definedaslHC(Cl,Ci)l. 

Zc,-Ecltf C (Ci,Q)l 



IR S = 



Class Richness: The class richness (CR) of a knowledge 
base is defined as the ratio of the number of classes used in the 
base (C v ) to the number of classes defined in the ontology 
schema (C). 



CR = 



in 

|C| 



Average Population: Formally, the average population (P) 
of classes in a knowledge base is defined as the number of 
instances of the knowledge base (I) to the number of classes 
defined in the ontology schema (C). 



|C| 



Importance: The importance (Imp) of a class Ci is defined 
as the number of instances that belong to the subtree rooted at 
Ci in the knowledge base (Ci(I)) compared to the total number 
of instances in the knowledge base (I). 



Imp = 



10(01 

l/l 



The following are some of Instance Metrics: 



Werner [4] provided a Realism-Based Metric for Quality 
Assurance in Ontology Matching. There are three levels 
introduced to the methodology for the measurement of quality 
improvements in single ontologies. These levels are: 

• Level I: reality, consisting of both instances and 
universals and also the various relations that acquire 
between them; 

• Level 2: the cognitive representations of this reality 
personified in observations and interpretations; 

• Level 3: the publicly accessible concretizations of the 
cognitive representations in representational artifacts of a 
range of sorts, of which ontologies are examples. 

Harith et al, [5] defined the metrics for Ranking 
Ontologies. In this paper AKTiveRank, a prototype system for 
ranking ontologies is proposed based on the analysis of their 
structures. This paper describes the metrics used in the ranking 
system. The ranking measures used are described below: 

Class Match Measure 

The Class Match Measure (CMM) is intended to estimate 
the coverage of ontology for the provided search terms. 
AKTiveRank looks for classes in every ontology that have 
labels matching a search term either exactly (class label 
identical to search term) or partially (class label "contains" the 
search term). 

Density Measure 

Density Measure (DEM) is deliberated to approximate the 
representational-density or information-content of classes and 
accordingly the level of knowledge detail. DEM considers 
how well the concept is additionally specified (the number of 
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subclasses), the number of attributes related with that concept, 
number of siblings, etc. 

Semantic Similarity Measure 

Similarity measures have often been used in information 
retrieval systems to afford enhanced ranking for query results. 
Ontologies can be analyzed as semantic graphs of concepts 
and relations, and hence similarity measures can be applied to 
explore these conceptual graphs. This helps in resolving 
ambiguities. 

Henry [7, 23] described a Measurement Ontology 
Generalizable for Emerging Domain Applications on the 
Semantic Web. The semantic Web is considered as the next 
generation Web of structured data that are automatically 
shared by software agents, which apply definitions and 
constraints structured in ontologies to correctly process data 
from contrasting sources. One aspect needed to develop 
semantic Web ontologies of emerging domains is creating 
ontologies of concepts that are common to those domains. 
These general ontologies can be used as building blocks to 
develop more domain-specific ontologies. However most 
measurement ontologies focus on representing units of 
measurement and quantities, and not on other measurement 
concepts such as sampling, mean values, and evaluations of 
quality based on measurements. In this paper, the author 
elaborates on a measurement ontology that represents all these 
concepts. This paper presents the generality of the ontology, 
and describes how it is developed, used for analysis and 
validated. 

Fensel et al, [8] provided OIL (Ontology Interchange 
Language): an ontology infrastructure for the Semantic Web. 
Initially, Researchers in artificial intelligence motivate the 
development of ontologies [14] to facilitate knowledge sharing 
and reuse. Ontologies [15] play a key role in supporting 
information exchange across different networks. A 
prerequisite for such a role lead to the development of a joint 
standard for specifying and exchanging ontologies. The 
authors present OIL which satisfies such standards. 

Carlos et al, [9] presented an Ontology-based Metrics 
Computation for Business Process Analysis. Business Process 
Management (BPM) aims to support the whole life-cycle 
required to deploy and maintain business processes in 
organizations. Analyzing business processes have a need of 
computing metrics that can facilitate determining the health of 
business activities and thus the whole enterprise. However, the 
degree of automation currently achieved cannot maintain the 
level of reactivity and adaptation demanded by businesses. In 
this paper the author argue and show how the use of Semantic 
Web technologies can enhance to an important extent the level 
of automation for analyzing business processes. The author 
presents a domain-independent ontological framework for 
Business Process Analysis (BPA) with support for 
automatically computing metrics. In particular, a set of 
ontologies for specifying metrics are defined in this paper. The 
domain-independent metrics computation engine is defined 
that can interpret and compute them. 



Orme et al, [10] described Coupling Metrics for Ontology- 
Based Systems. XML has grown to be frequent in Internet- 
based application domains such as business-to-business and 
business-to-consumer applications. It has moreover produced 
a basis for service-oriented architectures such as Web services 
and the Semantic Web, mainly because ontology data 
employed in the Semantic Web [16, 17] are stored in XML. 
Measuring system coupling is a generally accepted software 
engineering practice connected with producing high-quality 
software products. In many application domains, coupling can 
be assessed in ontology-based systems before system 
development by measuring coupling in ontology data. A 
proposed set of metrics determines coupling of ontology data 
in ontology-based systems [22] represented in the Web 
Ontology Language (OWL), a derivative of XML. 

Andrew et al, [1] define a semiotic metrics suite for 
assessing the quality of ontologies. A suite of metrics 
proposed here is to assess the quality of the ontology. The 
metrics evaluate the syntactic, semantic, pragmatic, and social 
aspects of ontology quality according to the semiotic theory. 
The author operationalizes the metrics and employs them in a 
prototype tool called the Ontology Auditor. A primary 
validation of the Ontology Auditor on the DARPA Agent 
Markup Language (DAML) library of domain ontologies 
represents that the metrics are feasible and highlights the wide 
variation in quality between ontologies in the library. The 
contribution of the research is to afford a theory-based 
framework that developers can utilize to develop high quality 
ontologies and that applications can exploit to choose 
appropriate ontologies for a given task. Zhe et al, [24] 
provides some Evaluation Metrics for Ontology Complexity 
and Evolution Analysis. 

Ying et al, [12] discusses about semantic web. Presently, 
computers are shifting from single, isolated devices into door 
points to a worldwide network of information exchange and 
business transactions called the World Wide Web (WWW). 
For this cause, support in data, information, and knowledge 
exchange has become a key issue in current computer 
technology. The achievement of the WWW has made it 
increasingly hard to find, access, present, and maintain the 
information required by a wide variety of users. In answer to 
this problem, many new research initiatives and commercial 
enterprises have been provided to enhance available 
information with machine processable semantics. This 
semantic web will offer intelligent access to heterogeneous, 
distributed information, enabling software products (agents) 
[20] to intervene between user needs and the information 
sources available. This paper reviews ongoing research in the 
area of the semantic web [19], focusing especially on ontology 
technology. 

Anthony et al, [13, 18] put forward the Complexity and 
coupling metrics for ontology based information. Ontologies 
are greatly used in bioinformatics and genomics to 
characterize the structure of living things. This research 
focuses on complexity metrics for ontologies. These 
complexity metrics are obtained from semantic relationships 
in an ontology. These metrics will assist for selecting the best 
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III. 



Conclusion 



In this survey, the different suite of metrics for evaluating 
ontologies based upon the semiotic-web is analyzed. It is 
better to assess the quality of web-ontology after the design is 
completed. This helps in neglecting the low quality ontology 
before its development by enhancing those defects in 
ontology. This helps in saving of considerable amount of cost 
and effort [2] for developing high quality semantic web 
systems. Several metrics such as Knowledge Enriched metric, 
Characteristics Relevancy metric, Domain Modularity metric, 
Richness, Instance Metrics, Semantic Similarity Measure, 
Density Measure, etc., are analyzed for assessing the quality 
of ontology. This survey helps in choosing the best suited 
metrics for assessing the quality of ontology. The different 
metrics which involve different criteria for ontology are 
analyzed in this survey. 
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Abstract — IDS which are increasingly a key part of system 
defense are used to identify abnormal activities in a computer 
system. In general, the traditional intrusion detection relies on 
the extensive knowledge of security experts, in particular, on 
their familiarity with the computer system to be protected. To 
reduce this dependence, various data-mining and machine 
learning techniques have been used in the literature. In the 
proposed system, we have designed fuzzy logic-based system for 
effectively identifying the intrusion activities within a network. 
The proposed fuzzy logic-based system can be able to detect an 
intrusion behavior of the networks since the rule base contains a 
better set of rules. Here, we have used automated strategy for 
generation of fuzzy rules, which are obtained from the definite 
rules using frequent items. The experiments and evaluations of 
the proposed intrusion detection system are performed with the 
KDD Cup 99 intrusion detection dataset. The experimental 
results clearly show that the proposed system achieved higher 
precision in identifying whether the records are normal or attack 
one. 

Keywords-Intrusion Detection System (IDS); Anomaly based 
intrusion detection; Fuzzy logic; Rule learning; KDD Cup 99 
dataset. 

I. Introduction 

Intrusion incidents to computer systems are increasing 
because of the commercialization of the Internet and local 
networks [1]. Computer systems are turning out to be more 
and more susceptible to attack, due to its extended network 
connectivity. The usual objective of the aforesaid attacks is to 
undermine the conventional security processes on the systems 
and perform actions in excess of the attacker's permissions. 
These actions could encompass reading secure or confidential 
data or just doing vicious destruction to the system or user 
files [2]. A system security operator can detect possibly 
malicious behaviors as they take place by setting up intricate 
tools, which incessantly monitors and informs activities [22]. 
Intrusion detection systems are turning out to be progressively 
significant in maintaining adequate network protection [1,3, 
4, 5]. An intrusion detection system (IDS) watches networked 
devices and searches for anomalous or malicious behaviors in 
the patterns of activity in the audit stream [6]. Capability of 
discriminating between standard and anomalous user 
behaviors should be present in a good intrusion detection 
system [7]. This would comprise of any event, state, content, 



or behavior that is regarded as abnormal by a pre-defined 
criterion [8]. 

Intrusion detection has emerged as a significant field of 
research, because it is not theoretically possible to set up a 
system with no vulnerabilities [9]. One main confrontation in 
intrusion detection is that we have to find out the concealed 
attacks from a large quantity of routine communication 
activities [10]. Several machine learning (ML) algorithms, for 
instance Neural Network [11], Support Vector Machine [12], 
Genetic Algorithm [13], Fuzzy Logic [14], and Data Mining 
[15] and more have been extensively employed to detect 
intrusion activities both known and unknown from large 
quantity of complex and dynamic datasets. Generating rules is 
vital for IDSs to differentiate standard behaviors from strange 
behavior by examining the dataset which is a list of tasks 
created by the operating system that are registered into a file in 
historical sorted order [16]. Various researches with data 
mining as the chief constituent has been carried to find out 
newly encountered intrusions [17]. The analysis of data to 
determine relationships and discover concealed patterns of 
data which otherwise would go unobserved is known as data 
mining. Many researchers have used data mining to focus into 
the subject of database intrusion detection in databases [18]. 

According to the detection strategy used, data mining- 
based intrusion detection systems can be classified into two 
main categories [23]. They are misuse detection which 
identifies intrusions using patterns of well known intrusions or 
weak spots of the system and anomaly detection, which 
attempts to find out if departure from the recognized standard 
usage patterns can be flagged as attacks [19]. (a) Misuse 
Detection: On the basis of the impressions of known 
intrusions and known system weaknesses misuse detection 
tries to model abnormal activities, (b) Anomaly Detection: 
Both user and system behavior can be predicted using normal 
behavior patterns. Anomaly detectors identify possible attack 
attempts by constructing profiles representing normal usage 
and then comparing it with current behavior data to find out a 
likely mismatch [20]. For specified, well-known intrusion 
excellent detection results are achieved by signature-based 
methods. But, they cannot find out unfamiliar intrusions 
though constructed as a least alteration of previously known 
attacks. Conversely, the capability of discovering intrusion 
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events which are previously unobserved is the main advantage 
of anomaly-based detection techniques [21]. 

In the proposed system, we have designed anomaly based 
intrusion detection using fuzzy logic. The input to the 
proposed system is KDD Cup 1999 dataset, which is divided 
into two subsets such as, training dataset and testing dataset. 
At first, the training dataset is classified into five subsets so 
that, four types of attacks (DoS (Denial of Service), R2L 
(Remote to Local), U2R (User to Root), Probe) and normal 
data are separated. After that, we simply mine the 1 -length 
frequent items from attack data as well as normal data. These 
mined frequent items are used to find the important attributes 
of the input dataset and the identified effective attributes are 
used to generate a set of definite and indefinite rules using 
deviation method. Then, we generate fuzzy rule in accordance 
with the definite rule by fuzzifying it in such a way, we obtain 
a set of fuzzy if-then rules with consequent parts that represent 
whether it is a normal data or an abnormal data. These rules 
are given to the fuzzy rule base to effectively learn the fuzzy 
system. In the testing phase, the test data is matched with 
fuzzy rules to detect whether the test data is an abnormal data 
or a normal data. 

The rest of the paper is organized as follows: Section II 
presents literature review of the proposed system and section 
III describes the detailed analysis of the KDD cup 99 dataset. 
The proposed intrusion detection system using fuzzy logic is 
given in section IV. Experimentation and performance 
analysis of the proposed system is discussed in section V. 
Finally, the conclusion is given in section VI. 

II. Review Of Recent Research 

Several techniques are available in the literature for 
detecting the intrusion behavior. In recent times, intrusion 
detection has received a lot of interest among the researchers 
since it is widely applied for preserving the security within a 
network. Here, we present some of the techniques used for 
intrusion detection. 

S. F. Owens and R. R. Levary [24] have stated that 
intruder detection systems have been commonly constructed 
using expert system technology. But, Intrusion Detection 
System (IDS) researchers have been biased in constructing 
systems that are difficult to handle, lack insightful user 
interfaces and are inconvenient to use in real-life 
circumstances. The proposed adaptive expert system has 
utilized fuzzy sets to find out attacks. The expert system 
comparatively easy to implement when used with computer 
system networks has the capability of adjusting to the nature 
and/or degree of the threat. Experiments with Clips 6.10 have 
been used to prove the adjusting capability of the system. Alok 
Sharma et al. [25] have focused on the use of text processing 
techniques on the system call sequences for intrusion 
detection. Host-based intrusions have been detected by 
introducing a kernel based similarity measure. Processes have 
been classified either as normal or abnormal using the k- 
nearest neighbor (kNN) classifier. They have assessed the 
proposed method on the DARPA-1998 database and compared 
its operation with other existing methods present in the 
literature. 



Shi-Jinn Horng et al. [26] have used a combination of 
hierarchical clustering algorithm, easy feature selection 
method, and SVM technique in their proposed SVM-based 
intrusion detection system. Fewer, abstracted, and higher- 
qualified training instances that are derived from the KDD 
Cup 1999 training set has been given to the SVM by the 
hierarchical clustering algorithm. The simple feature selection 
method employed for the removal of insignificant features 
from the training set has enabled the proposed SVM model to 
achieve more precise classification of the network traffic data. 
The proposed system has been assessed using the renowned 
KDD Cup 1999 dataset. Compared to other intrusion detection 
systems that are based on the same dataset, the proposed 
method has exhibited superior performance in identifying DoS 
and Probe attacks and an overall best performance in accuracy. 

B. Shanmugam and Norbik Bashah Idris [28] have 
proposed an advanced fuzzy and data mining methods based 
hybrid model to find out both misuse and anomaly attacks. 
Their objective was to decrease the quantity of data kept for 
processing and also to improve the detection rate of the 
existing IDS using attribute selection process and data mining 
technique respectively. A modified version of APRIORI 
algorithm which is an improved Kuok fuzzy data mining 
algorithm utilized for implementing fuzzy rules has enabled 
the generation of if-then rules that show common ways of 
expressing security attacks. They have achieved faster 
decision making using mamdani inference mechanism with 
three variable inputs in the fuzzy inference engine which they 
have employed. The DARPA 1999 data set has been used to 
test and benchmark the efficiency of the proposed model. In 
addition, the test results against the "live" networking 
environment within the campus have been analyzed. 

O. A. Adebayo et al. [29] have presented a method that 
uses Fuzzy-Bayesian to detect real-time network anomaly 
attack for discovering malicious activity against computer 
network. They have established the effectiveness of the 
method by describing the framework. The overall performance 
of the intrusion detection system (IDS) based on Bayes has 
been improved by a combination of fuzzy with Bayesian 
classifier. In addition, by the experiment carried out on KDD 
1999 IDS data set, the practicability of the method has been 
verified. Abadeh, M.S. and Habibi, J. [27] have proposed a 
method to develop fuzzy classification rules for intrusion 
detection use in computer networks. The method of fuzzy rule 
base system design has been based on the iterative rule 
learning approach (IRL). Using the evolutionary algorithm to 
optimize one fuzzy classifier rule at a time, the fuzzy rule base 
has been created in an incremental fashion. Intrusion detection 
problem has been used as a high-dimensional classification 
problem to analyze the functioning of the final fuzzy 
classification system. Results have demonstrated that the fuzzy 
rules generated by the proposed algorithm can be utilized to 
build a reliable intrusion detection system. 

Arman Tajbakhsh et al. [30] have presented a data mining 
technique based framework for constructing an IDS. In the 
framework, Association Based Classification (ABC) has been 
used by the classification engine which is in fact the central 
part of the IDS. Fuzzy association rules have been used by the 
proposed classification to construct classifiers. Some matching 
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measures have been used to evaluate the consistency of any 
new sample (which is to be categorized) with various class 
rulesets and the label of the sample has been declared as the 
class that is analogous to the best matched ruleset. A method 
which decreases the items that may be included in extracted 
rules has also been proposed to reduce the time taken by the 
rule induction algorithm. The framework has been assessed 
using KDD-99 dataset. The results have shown that the 
achieved total detection rate and detection rate of known 
attacks are large and false positive rate is small, though the 
results are not bright for unknown attacks. 

Zhenwei Yu et al. [31] have presented an automatically 
tuning intrusion detection system (ATIDS). According to the 
feedback supplied by the system operator, when false 
predictions are detected, the proposed system automatically 
tunes the detection model on-the-fly. The KDDCup'99 
intrusion detection dataset has been used to assess the system. 
In the experimental results, the system has demonstrated a 
35% enhancement with regard to misclassification cost 
compared to a system that is not using the tuning feature. If 
the model has been tuned using only 10% false predictions 
still a 30% improvement is achieved by the system. Moreover, 
the model tuned using only 1.3% of the false predictions have 
been capable of achieving about 20% improvement provided 
the tuning is not delayed too long. Building a practical system 
based on ATIDS has been proved to be feasible by the results 
of the experiments: Because predictions ascertained to be false 
have been used for tuning the detection model, system 
operators can concentrate on confirmation of predictions with 
low confidence. 

III. Kdd Cup 99 Dataset 

In 1998, DARPA in concert with Lincoln Laboratory at 
MIT launched the DARPA 1998 dataset for evaluating IDS 
[36]. The DARPA 1998 dataset contains seven weeks of 
training and also two weeks of testing data. In total, there are 
38 attacks in training data as well as in testing data. The 
refined version of DARPA dataset which contains only 
network data (i.e. Tcpdump data) is termed as KDD dataset 
[37]. The Third International Knowledge Discovery and Data 
Mining Tools Competition were held in colligation with KDD- 
99, the Fifth International Conference on Knowledge 
Discovery and Data Mining. KDD dataset is a dataset 
employed for this Third International Knowledge Discovery 
and Data Mining Tools Competition. KDD training dataset 
consists of relatively 4,900,000 single connection vectors 
where each single connection vectors consists of 41 features 
and is marked as either normal or an attack, with exactly one 
particular attack type [38]. These features had all forms of 



continuous and symbolic with extensively varying ranges 
falling in four categories: 

• In a connection, the first category consists of the intrinsic 
features which comprises of the fundamental features of each 
individual TCP connections. Some of the features for each 
individual TCP connections are duration of the connection, the 
type of the protocol (TCP, UDP, etc.) and network service 
(http, telnet, etc.). 

• The content features suggested by domain knowledge are 
used to assess the payload of the original TCP packets, such as 
the number of failed login attempts. 

• Within a connection, the same host features observe the 
recognized connections that have the same destination host as 
present connection in past two seconds and the statistics 
related to the protocol behavior, service, etc are estimated. 

• The similar same service features scrutinize the 
connections that have the same service as the current 
connection in past two seconds. 

A variety of attacks incorporated in the dataset fall into 
following four major categories: Denial of Service Attacks: 
A denial of service attack is an attack where the attacker 
constructs some computing or memory resource fully 
occupied or unavailable to manage legitimate requirements, or 
reject legitimate users right to use a machine. User to Root 
Attacks: User to Root exploits are a category of exploits 
where the attacker initiate by accessing a normal user account 
on the system (possibly achieved by tracking down the 
passwords, a dictionary attack, or social engineering) and take 
advantage of some susceptibility to achieve root access to the 
system. Remote to User Attacks: A Remote to User attack 
takes place when an attacker who has the capability to send 
packets to a machine over a network but does not have an 
account on that machine, makes use of some vulnerability to 
achieve local access as a user of that machine. Probes: 
Probing is a category of attacks where an attacker examines a 
network to collect information or discover well-known 
vulnerabilities. These network investigations are reasonably 
valuable for an attacker who is staging an attack in future. An 
attacker who has a record, of which machines and services are 
accessible on a given network, can make use of this 
information to look for fragile points. 

Table I illustrates a number of attacks falling into four 
major categories and table II presents a complete listing of a 
set of features characterized for the connection records. 



Table I. Various types of attacks described in four major categories 



Denial of 
Service Attacks 


Back, land, neptune, pod, smurf, teardrop 


User to Root 
Attacks 


Buffer_overflow, loadmodule, perl, rootkit, 


Remote to 
Local Attacks 


Ftp_write, guess_passwd, imap, multihop, 
phf, spy, warezclient, warezmaster 


Probes 


Satan, ip sweep, nmap, ports weep 
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Table ii. A complete list of features given in KDD cup 99 dataset 



Feature 
index 


Feature name 


Description 


Type 


1 


duration 


length (number of seconds) of the connection 


continuous 


2 


protocol_type 


type of the protocol, e.g. tcp, udp, etc. 


symbolic 


3 


service 


network service on the destination, e.g., http, telnet, etc. 


symbolic 


4 


flag 


normal or error status of the connection 


symbolic 


5 


src_bytes 


number of data bytes from source to destination 


continuous 


6 


dst_bytes 


number of data bytes from destination to source 


continuous 


7 


Land 


1 if connection is from/to the same host/port; 
otherwise 


symbolic 


8 


wrong_fragment 


number of vv wrong" fragments 


continuous 


9 


urgent 


number of urgent packets 


Continuous 


10 


hot 


number of vv hot" indicators 


Continuous 


11 


num_failed_logins 


number of failed login attempts 


Continuous 


12 


logged_in 


1 if successfully logged in; otherwise 


Symbolic 


13 


num_compromised 


number of vv compromised" conditions 


Continuous 


14 


root_shell 


1 if root shell is obtained; otherwise 


Continuous 


15 


su_attempted 


1 if vv su root" command attempted; otherwise 


Continuous 


16 j 


num_root 


number of vv root" accesses 


Continuous 


17 


num_file_creations 


number of file creation operations 


Continuous 


18 


num_shells 


number of shell prompts 


Continuous 


19 


num_access_files 


number of operations on access control files 


Continuous 


20 


num_outbound_cmds 


number of outbound commands in an ftp session 


Continuous 


21 


is_hot_login 


1 if the login belongs to the vv hot" list; otherwise 


Symbolic 


22 


is_guest_login 


1 if the login is a vv guest" login; otherwise 


Symbolic 


23 


count 


number of connections to the same host as the current 
connection in the past two seconds 


continuous 


24 


srv_count 


number of connections to the same service as the current 
connection in the past two seconds 


Continuous 


25 


serror_rate 


% of connections that have VV SYN" errors 


continuous 


26 


srv_serror_rate 


% of connections that have VV SYN" errors 


Continuous 


27 


rerror_rate 


% of connections that have VV REJ" errors 


Continuous 


28 


srv_rerror_rate 


% of connections that have VV REJ" errors 


Continuous 


29 


same_srv_rate 


% of connections to the same service 


Continuous 


30 


diff_srv_rate 


% of connections to different services 


Continuous 


31 


sr v_dif f _ho st_r ate 


% of connections to different hosts 


Continuous 


32 


dst_host_count 


count for destination host 


continuous 


33 


dst_host_srv_count 


srv_count for destination host 


continuous 


34 


dst_host_same_srv_rat 
e 


same_srv_rate for destination host 


continuous 


35 


dst_host_diff_srv_rate 


diff_srv_rate for destination host 


continuous 


36 


dst_host_same_src_po 
rt_rate 


same_src_port_rate for destination host 


continuous 


37 


dst_ho st_sr v_dif f _ho st 
_rate 


diff_host_rate for destination host 


continuous 


38 


dst_host_serror_rate 


serror_rate for destination host 


continuous 


39 


dst_host_srv_serror_ra 
te 


srv_serror_rate for destination host 


continuous 


40 


dst_ho st_rerror_rate 


rerror_rate for destination host 


continuous 


41 


dst_host_srv_rerror_ra 
te 


srv_serror_rate for destination host 


continuous 
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IV. An Anomaly-Based Network Intrusion Detection 
System Using Fuzzy Logic 

Presently, it is unfeasible for several computer systems to 
affirm security to network intrusions with computers 
increasingly getting connected to public accessible networks 
(e.g., the Internet). In view of the fact that there is no ideal 
solution to avoid intrusions from event, it is very significant to 
detect them at the initial moment of happening and take 
necessary actions for reducing the likely damage [32]. One 
approach to handle suspicious behaviors inside a network is an 
intrusion detection system (IDS). For intrusion detection, a 
wide variety of techniques have been applied specifically, data 
mining techniques, artificial intelligence technique and soft 
computing techniques. Most of the data mining techniques like 
association rule mining, clustering and classification have 
been applied on intrusion detection, where classification and 
pattern mining is an important technique. Similar way, AI 
techniques such as decision trees, neural networks and fuzzy 
logic are applied for detecting suspicious activities in a 
network, in which fuzzy based system provides significant 
advantages over other AI techniques. 

Recently, several researchers focused on fuzzy rule 
learning for effective intrusion detection using data mining 
techniques. By taking into consideration these motivational 
thoughts, we have developed a fuzzy rule based system in 
detecting the attacks. This system, anomaly-based intrusion 
detection makes use of effective rules identified in accordance 
with the designed strategy, which is obtained by mining the 
data effectively. The fuzzy rules generated from the proposed 
strategy can be able to provide better classification rate in 
detecting the intrusion behavior. Even though signature-based 
systems provide good detection results for specified and 
familiar attacks, the foremost advantage of anomaly-based 
detection techniques is their ability to detect formerly unseen 
and unfamiliar intrusion occurrences. On the other hand and in 
spite of the expected erroneousness in recognized signature 
specifications, the rate of false positives in anomaly-based 
systems is generally higher than in signature based ones [21]. 
The different steps involved in the proposed system for 
anomaly-based intrusion detection (shown in figure 1) are 
described as follows: 

(1) Classification of training data 

(2) Strategy for generation of fuzzy rules 

(3) Fuzzy decision module 

(4) Finding an appropriate classification for a test input 



KDD cup-99 Data set 



Training Data (41 
features) 



Testing Data (41 
features) 



Modified Training 
Data (34 features) 



Modified Test Data 
(34 features) 



Classifying the 
Training Data 



Mining of single 
length items 



Identification of 
suitable attributes 



Generation of 
fuzzy Rules 



Output 



Fuzzifier 



Defuzzifier 






a 



Inference Engine 



Membership function 



Rule Base 

A 



Figure 1 . The overall steps of the proposed intrusion detection system 

(1) Classification of Training Data 

The first component of the proposed system is of 
classifying the input data into multiple classes by taking in 
mind the different attacks involved in the intrusion detection 
dataset. The dataset we have taken for analyzing the intrusion 
detection behavior using the proposed system is KDD-Cup 
1999 data. The detailed analysis of KDD-Cup 1999 data is 
given in section III. Based on the analysis, the KDD-Cup 1999 
data contains four types of attacks and normal behavior data 
with 41 attributes that have both continuous and symbolic 
attributes. The proposed system is designed only for the 
continuous attributes because the major attributes in KDD- 
Cup 1999 data are continuous in nature. Therefore, we have 
taken only the continuous attributes for instance, 34 attributes 
from the input dataset by removing discrete attributes. Then, 
the dataset ( D ) is divided into five subsets of classes based on 
the class label prescribed in the dataset D = {Dj ; 1 < i < 5} . 

The class label describes several attacks, which comes under 
four major attacks (Denial of Service, Remote to Local, U2R 
and Probe) along with normal data. The five subsets of data 
are then used for generating a better set of fuzzy rules 
automatically so that the fuzzy system can learn the rules 
effectively. 

(2) Strategy For Generation of Fuzzy Rules 

This section describes the designed strategy for automatic 
generation of fuzzy rules to provide effective learning. In 
general, the fuzzy rules given to the fuzzy system is done 
manually or by experts, who are given the rules by analyzing 
intrusion behavior. But, in our case, it is very difficult to 
generate fuzzy rules manually due to the fact that the input 
data is huge and also having more attributes. But, a few of 
researches are available in the literature for automatically 
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identifying of fuzzy rules in recent times. Motivated by this 
fact, we make use of mining methods to identify a better set of 
rules. Here, definite rules obtained from the single length 
frequent items are used to provide the proper learning of fuzzy 
system. The process of fuzzy generation is given in the 
following sub-section. 

(a) Mining of single length frequent items 

At first, frequent items (attributes) are discovered from 
both classes of input data and by using these frequent items, 
the significant attributes are identified for the input KDD-cup 
99 dataset. In general, frequent itemset are mined using 
various conventional mining algorithms, such as Apriori [35] 
and FP-Growth [40]. These algorithms are suitable to mine 
frequent itemset with varying length only for the binary 
database, which contains only the binary values. But, the input 
dataset (KDD cup-99) contains continuous variable for each 
attributes so that, the conventional algorithm is not suitable for 
mining frequent items. By considering this property, we 
simply find the 1 -length items from each attributes by finding 
the frequency of the continuous variable present in each 
attribute and then, the frequent items are discovered by 
inputting the minimum support. These frequent items are 
identified for both class namely, normal and attack (combining 
four types of attacks). 

(b) Identification of suitable attributes for rule generation 

In this step, we have chosen only the most suitable 
attributes for identifying the classification whether the record 
is normal or attack. The reason behind this step is that the 
input data contain 34 attribute, in which all the attributes are 
not so effective in detecting the intrusion detection. For 
identifying the suitable attribute, we have used deviation 
method, where mined 1 -length frequent items are used. At 
first, the mined 1-length items from each attribute are stored in 
a vector so that 34 vectors are obtained for each class (class 1 
and class 2), represented as, C/ = [Vi ,V2>"* , V y , • - • , V34 J 

where, i = 1 (refer to normal) , 2 (refer to attack) . Each vector 
(Vj ) contains frequent items, whose frequency is greater than 
minimum support. Vj = {// ; 1 < i < m\ ; 

support(fi ) > min_supp . Then, for each attribute, deviation 

range of frequent items is identified by comparing the frequent 
items present within a vector such a way, the deviation range 
{max, min} is obtained for every vector. 

D v(j) = i/max > /min I J where > /max = Max{fj ) ; / min = Min{fj ) 

Then, one-to-one comparison is performed in between both 
class of respective vector to identify the effective attribute. 
The attributes that not contain identical {max, min} range for 
both class is chosen as effective attribute, which will give 
significant detection rate rather than utilizing the all attribute 
for identifying the classification. The effective attributes 
chosen for rule generation process is represented 

9 C t =|V (1) ,y (2) ,---,y (j ' ) ,---,y (/:) J, Where, k<34. 



as, 



(c) Rule generation 

The effective attributes chosen from the previous step is 
utilized to generate rules that is derived from the {max, min} 
deviation. By comparing the deviation range of effective 



attributes in between the normal and attack data, the 
intersection points are identified for the effective attributes. By 
making use of these two intersection points, the definite and 
indefinite rules are generated. For example, {max, min} 
deviation range for normal data related to attribute 1 is {1, 5} 
and {max, min} deviation for attack data corresponding to 
attribute 1 is {2, 8}. Then, the rule is designed like, "IF 
attributel is greater than 5, THEN the data is attack, "IF 
attributel is in between 2 and 5, THEN the data is normal OR 
attack" and "IF attributel is less than 2, THEN the data is 
normal" . In addition to that, some of the data contains only 
one intersection point, which provides only two rules. 

(d) Rule filtering 

In order to learn the fuzzy rules efficiently and design a 
compact and interpretable classification system, we should 
concentrate in these two criteria given in [33, 34]: (1) The 
number of fuzzy rules should be decreased as much as 
possible, (2) The IF part of fuzzy rules should be short. By 
concentrating on these two criteria, we have filtered the rules 
such a way that, we take only the short and less number of 
rules. The rules that are generated from the previous step 
contain definite and indefinite rules. The definite rules are the 
rules that contain only one classified label in the THEN part 
and indefinite rule contain two classification label data in the 
THEN part. The proposed rule filtering technique filters the 
indefinite rule and selects only the definite rules for learning 
the fuzzy system. 

(e) Generating fuzzy rules 

In general, fuzzy rules are defined within the fuzzy system 
manually or the rules are obtained from the domain expert. 
But, in the proposed system, we automatically find the fuzzy 
rules based on the mined 1-length frequent items. The fuzzy 
rules are generated from the definite rules, where the IF part of 
the rule is a numerical variable and THEN part is a class label 
related to attack name or normal. But, the fuzzy rule should 
contain only the linguistic variable. So, in order to make the 
fuzzy rules from the definite rules, we should fuzzify the 
numerical variable of the definite rules and THEN part of the 
fuzzy rule is same as the consequent part of the definite rules. 
For example, "IF attributel is H, THEN the data is attack and 
"IF attributel is VL, THEN the data is normal". These fuzzy 
rules are used to learn the fuzzy system so that the 
effectiveness of the proposed system will be improved rather 
than simply using the fuzzy rules without any proper 
techniques. 

(3) Fuzzy Decision Module 

This section describes the designing of fuzzy logic system 
for finding the suitable class label of the test dataset. Zadeh in 
the late 1960s [39] introduced Fuzzy logic and is known as the 
rediscovery of multivalued logic designed by Lukasiewicz. 
The designed fuzzy system shown in figure 2 contains 34 
inputs and one output, where inputs are related to the 34 
attributes and output is related to the class label (attack data or 
normal data). Here, thirty four-input, single-output of 
Mamdani fuzzy inference system with centroid of area 
defuzzification strategy was used for this purpose. Here, each 
input fuzzy set defined in the fuzzy system includes four 
membership functions (VL, L, M and H) and an output fuzzy 
set contains two membership functions (L and H). Each 
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membership function used triangular function for fuzzification 
strategy. The fuzzy rules obtained from sub-section IV.B are 
fed to the fuzzy rule base for learning the system. 

Input 1 



Table IV. Training dataset taken for evaluation 



Input 2 



Input 34 








Output 


> 




Mamdani fuzzy 


^<^ 


inference system 





Figure 2. The designed Fuzzy system 

(4) Finding an Appropriate Classification for a Test Input 

For testing phase, a test data from the KDD-cup 99 dataset 
is given to the designed fuzzy logic system discussed in sub- 
section IV.C for finding the fuzzy score. At first, the test input 
data containing 34 attributes is applied to fuzzifier, which 
converts 34 attributes (numerical variable) into linguistic 
variable using the triangular membership function. The output 
of the fuzzifier is fed to the inference engine which in turn 
compares that particular input with the rule base. Rule base is 
a knowledge base which contains a set of rules obtained from 
the definite rules. The output of inference engine is one of the 
linguistic values from the following set {Low and High} and 
then, it is converted by the defuzzifier as crisp values. The 
crisp value obtained from the fuzzy inference engine is varied 
in between to 2, where '0' denotes that the data is 
completely normal and '1' specifies the completely attacked 
data. 

V. Experimentation 

This section describes the experimental results and 
performance evaluation of the proposed system. The proposed 
system is implemented in MATLAB (7.8) and the 
performance of the system is evaluated using Precision, recall 
and F-measure. For experimental evaluation, we have taken 
KDD cup 99 dataset [37], which is mostly used for evaluating 
the performance of the intrusion detection system. For 
evaluating the performance, it is very difficult to execute the 
proposed system on the KDD cup 99 dataset since it is a large 
scale. Here, we have used subset of 10% of KDD Cup 99 
dataset for training and testing. The number of records taken 
for testing and training phase is given in table III and table IV. 

Table III. Training dataset taken for experimentation 



Training Dataset 


Normal 


25,000 


DOS 


25,000 


Probe 


4107 


RLA 


77 


URA 


42 



Testing Dataset 


Normal 


26,000 


DOS 


26,000 


Probe 


4107 


RLA 


77 


URA 


42 



A. Experimental Results and Performance Analysis 

The training dataset contains normal data as well as four 
types of attacks, which are given to the proposed system for 
identifying the suitable attributes. The selected attribute for 
rule generation process is given in table V. Then, using the 
fuzzy rule learning strategy, the system generates definite and 
indefinite rules and finally, fuzzy rules are generated from the 
definite rules. 

Table V. Selected attributes for rule generation 



Attribute Index 


Selected Attributes 


1 


duration 


5 


src_bytes 


6 


dst_bytes 


8 


wrong_fragment 


9 


urgent 


10 


hot 


11 


num_failed_logins 


13 


num_compromised 


16 


num_root 


17 


num_file_creations 


18 


num_shells 


19 


num_access_files 


23 


count 


24 


srv_count 



In the testing phase, the testing dataset is given to the 
proposed system, which classifies the input as a normal or 
attack. The obtained result is then used to compute overall 
accuracy of the proposed system. The overall accuracy of the 
proposed system is computed based on the definitions, namely 
precision, recall and F-measure which are normally used to 
estimate the rare class prediction. It is advantageous to 
accomplish a high recall devoid of loss of precision. F- 
measure is a weighted harmonic mean which evaluates the 
trade-off between them. 



Precision ■ 



Recall = - 



TP 



TP + FP 
TP 



TP + FN 



F - measure ■ 



Overall accuracy = 



(/? + \){Precision • Recall) 
9 
P • Precision + Recall 

TP + TN 



where, /3 = \ 



TP + TN + FN + FP 



Where, TP -> True positive 
TN ->True negative 
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FN -> False negative 
FP -> False positive 
These are computed using the confusion matrix in Table 
VI, and defined as follows: 



Table VI. Confusion matrix 



Actual 
Class 



Positive 
class 



Negative 
class 



Predicted class 


Positive 


Negative 


class 


class 


True 


False 


positive 


negative 


(TP) 


(FN) 


False 


True 


positive 


negative 


(FP) 


(TN) 



The evaluation metrics are computed for both training and 
testing dataset in the testing phase and the obtained result for 
all attacks and normal data are given in table VII, which is the 
overall classification performance of the proposed system on 
KDD cup 99 dataset. By analyzing the result, the overall 
performance of the proposed system is improved significantly 
and it achieves more than 90% accuracy for all types of 
attacks. 

Table VII. The classification performance of the proposed intrusion 

DETECTION SYSTEM 



Metric 



Proposed System 
Training Testing 



PROBE 



DOS 



U2R 



R2L 



NORMAL 



Precision 
Recall 
F-measure 
Accuracy 

Precision 
Recall 
F-measure 
Accuracy 

Precision 
Recall 
F-measure 
Accuracy 

Precision 
Recall 
F-measure 
Accuracy 

Precision 
Recall 
F-measure 
Accuracy 



0.912522 
0.37083 
0.52735457 
0.906208 



0.912522 
0.37083 
0.52735457 
0.909323 



0.993563 0.993828 

0.90144 0.904154 

0.94526236 0.94687236 

0.9478 0.949269 

0.051948 0.051948 

0.190476 0.190476 

0.08163265 0.08163265 

0.992812 0.993088 

0.075949 0.075949 

0.155844 0.155844 

0.10212766 0.10212766 

0.991586 0.991909 



0.828439 
0.99416 
0.90376539 
0.910852 



0.829318 
0.994385 
0.90438129 
0.903019 



VI. Conclusion 

We have developed an anomaly based intrusion detection 
system in detecting the intrusion behavior within a network. A 
fuzzy decision-making module was designed to build the 
system more accurate for attack detection, using the fuzzy 
inference approach. An effective set of fuzzy rules for 
inference approach were identified automatically by making 
use of the fuzzy rule learning strategy, which are more 
effective for detecting intrusion in a computer network. At 
first, the definite rules were generated by mining the single 
length frequent items from attack data as well as normal data. 
Then, fuzzy rules were identified by fuzzifying the definite 
rules and these rules were given to fuzzy system, which 
classify the test data. We have used KDD cup 99 dataset for 
evaluating the performance of the proposed system and 
experimentation results showed that the proposed method is 
effective in detecting various intrusions in computer networks. 
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Abstract — The computational paradigm known as quantum-dot 
cellular automata (QCA) encodes binary information in the 
charge configuration of Coulomb-coupled quantum-dot cells. 
Functioning QCA devices made of metal-dot cells have been 
fabricated and measured. We focus here on the issue of 
robustness in the presence of disorder and thermal fluctuations. 
We examine the performance of a semi-infinite QCA shift 
register as a function of both clock period and temperature. The 
existence of power gain in QCA cells acts to restore signal levels 
even in situations where high speed operation and high 
temperature operation threaten signal stability. Random 
variations in capacitance values can also be tolerated. 

Keywords-component; QCA, molecular electronics, single 
electronics, quantum-dot cellular automata, nanoelectronics 

I. Introduction 

Conventional transistor-based CMOS technology faces 

great challenges with the down-scaling of device sizes in 

recent years. Issues such as quantum effects, dopant-induced 
disorder, and power dissipation may hinder further progress in 
scaling microelectronics. As the scaling approaches a 
molecular level, a new paradigm beyond using current switches 
to encode binary information may be needed. Quantum-dot 
cellular automata (QCA) [1-3, 6, 12, 13, 18, 21] emerges as 
one such a paradigm. In the QCA approach bit information is 

encoded in the charge configuration within a cell. Columbic 
interaction between cells is sufficient to accomplish the 
computation; no current flows out of the cell. It has been shown 
that very low power dissipation is possible [8]. 

A clocked QCA cell constructed with six quantum dots is 
shown in Fig. 1. Dots are simply places where a charge is 
localized. Two mobile electrons are present in the cell. The 
electrons will occupy antipodal sites in the corner dots because 
of Coulomb repulsion. The two configuration states correspond 
to binary information of " 1" and "0" The electrons can also be 
pulled to middle dots if the occupancy energy in the middle 
dots is lower than corner dots. In this case we term the 
configuration "null" with no binary information present. The 
clock adjusts the relative occupancy energy between active dots 



in the corner and null dots in the middle, pushing electrons to 
either active dots or null dots. The cell therefore switches 
between null state and active state. When a cell is placed close 
to another cell (as shown in Fig. lb), they will have the same 
polarization due to Coulomb coupling. Based on the cell-to-cell 
interaction, logical QCA devices like binary wires, inverters, 
majority gates and full adders can all be implemented [18]. 

QCA devices exist. QCA devices made of metal-dot cells 
have been successfully demonstrated at low temperatures. 
Majority gates, binary wires, memories, clocked shift registers 
and fan outs have all been fabricated [1-3, 12, 13, 21]. Figure 2 
shows a schematic diagram and scanning electron micrograph 
of a clocked shift register. Aluminum islands form the dots and 
Al/AlOx tunnel junctions serve as the tunneling path between 
dots. 

Tunnel junctions are fabricated with shadow evaporation 
technique. Multiple tunnel junctions are used instead of a single 
junction to suppress co-tunneling. The clock is implemented by 
simply applying voltage to leads capacitively coupled to the 
middle dots. Single electron transistors (SET_s) are used as 
readout electrometers. Though the operation of metal-dot QCA 
devices is restricted to cryogenic temperatures, they may be 
viewed as prototypes for molecular QCA cells that will operate 
at room temperature. It may well be that molecular QCA,with 
the possibility of enormous functional densities, very low 
power dissipation, and room temperature operation, is finally 
the most promising system [5, 9-11, 14, 16]. 

Metal-dot QCA do have the advantage of having been 
already created and tested, and we expect that understanding 
the details of robustness in the metal-dot system will yield 
benefits for designing molecular systems. Here we focus on 
the robustness in metal-dot QCA circuits. In particular, we 
consider theoretically the effects of temperature, random 
variations in capacitance, and operating speed, on the 
performance of a semiinfinite QCA shift register. The paper is 
organized as follows: in Section II, we describe the application 
of single-electron tunneling theory to metal QCA devices. 

Section III describes the characterization of power gain in 
QCA circuits. In Section IV we analyze the operation of a 
semi-infinite QCA shift register. Finally, in Section V we 
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and non-leaky capacitors. 



Leads by contrast are metal electrodes whose voltages are 
fixed by external sources. We define dot charge qi as the 
charge on island i and qk as the charge on lead k. The free 
energy of charge configuration within the circuit is the 
electrostatic energy stored in the capacitors and tunnel 
junctions minus the work done by the voltage sources [20]: 



Fig. I. Schematic of a QCA cell, a T- 
hc three states of a single cell, b Coulomb 
interactions couple the states of neighbor- 
ing cells. ca 

lculate behavior of the QCA shift register in the limits 
of high speed, high temperature, and high defect levels. 

II. Single Electron System Theory 

Metal-dot QCA can be described with the orthodox theory 
of coulomb blockade [19]. The circuit is defined by charge 
configurations, which are determined by the number of 
electrons on each of the metal islands. Metal islands are regions 
of metal surrounded by insulators; at zero temperature they 
hold an integer number of charges. The islands play the role of 
QCA dots and are coupled to other islands and leads through 
tunnel junctions (i.e., quantummechanically leaky capacitors) 



F = - 






T ' 



0) 



Here C is the capacitance matrix including all the junctions 
and capacitors, v is the column vector of lead voltages, and q 
and qO are the column vectors of dot charges and lead charges. 
At zero temperature, the equilibrium charge configuration is 
the one that has minimum free energy and the number of 
charges on each islands is exactly an integer. A tunneling event 
happens at zero temperature only if the free energy is lower for 
the final state than for the initial state. At finite temperatures, a 
dot charge need no longer be an integer but is rather a thermal 
average over all possible configurations. A thermally excited 
tunneling event may happen even when the 
free energy increases. The transition rate of tunneling between 
two charge configuration states at a certain temperature T is 
given by 



r„- 
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where RT is the tunneling resistance, Af Vis the energy 
difference between the initial state i and final state j. 

The tunneling events can be described by a master 
equation — a conservation law for the temporal change of the 
probability distribution function of a physical quantity, 
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Fig. 2. a Schematic of a clocked itfii ft register, b Scanning electron micrograph of a clocked shift register. 
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where P is the vector of state probabilities and * is the 
transition matrix. From the solution P(t) we can obtain the 
ensemble average of the charge on each dot. We solve Eq. 3 
directly and find the dot charge as a function of time; from this 
we can obtain any other voltage or charge in the circuit. In 
many systems direct solution of the master equation, which 
requires the enumeration of all the accessible states of the 
system is impractical due to the large set of accessible states. 
Because QCA operates so near the instantaneous ground state 
of the system, complete enumeration of the accessible states is 
possible and we need not resort to Monte Carlo methods. 

III. Power Gain in QCA 

A robust circuit must have power gain in order to restore 
signals weakened due to unavoidable dissipative processes. In 
conventional CMOS, the power supply provides the energy 
power gain. In QCA systems the energy needed for power gain 
is supplied by the clock. A weak input is augmented by work 
done by the clock to restore logic levels. Power gain has been 
studied theoretically in molecular QCA circuits [8] and 
measured experimentally in metal-dot QCA circuits [3]. Power 
gain is defined by the ratio of the work done by the cell on its 
neighbor to the right (the output of the cell), to the work done 
on the cell by its neighbor to the left (the input to the cell). The 
work done on a cell by an input lead coupled through an input 
capacitor C over a time interval T is given by 
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values of input voltage and clock voltage, the lowest free 
energy. 

The clocking cycle can be envisioned as follows. First, a 
small input bias is applied, when the clock is high (less 



W 



-I 



V(t)^QA0dt 



(4) 



where V(t) is the lead voltage, Qc(t) is the charge on the input 
capacitor. We consider the total work done over a clock period 
so the cell configuration is the same at t=0 and t=T. The power 
gain is thus the ratio of output to input signal power W out /W in , 
where each sums the work done by (on) all input (output) leads. 



IV. Operation of Semi-infinite QCA Shift Register 

The schematic of a clocked half QCA cell is shown in 
Fig.3a. The capacitances are taken to be Cj=1.6 aF, Cg=0.32 
aF, Cc=0.8 aF, and the tunneling resistance RT=100 kW. 
Eachisland is grounded through a capacitance of 0.32 aF. These 
are physically reasonable though somewhat better (meaning 
capacitances are smaller) than the experiments have so-far 
achieved. Input is applied to the top and bottom dot through 
coupling capacitors. The potential difference between the top 
and bottom dots is the output Vcell. 

The phase diagram of the equilibrium charge state 
configuration of the cell shown in Fig. 3a is plotted in Fig. 4. 
The diagram shows the calculated stable regions of charge 
configuration as a function of input and clock potential. Each 
hexagonal region is labeled by three integers (nl, n2, n3), the 
number of elementary charges in the top, middle, and bottom 
dot, respectively. A positive number indicates an extra hole 
and negative number represents an extra electron. Each 
hexagon represents the configuration state that has, for those 
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Mjut-V«|i 



4 




Fig. 3. a Schematic of a clocked triple dot. The input is applied to the 
top and bottom dot. The clock is set to the middle dot. The output 
defined as Vcell is the differential potential etween the top and the 
bottom dot. Cj=1.6 aF, Cg=0.32 aF, Cc=0.8 aF. The capacitor to 
ground is 0.32 aF. RT=100 kW. b Schematic of a shift register 
composed of a line of identical triple dots in a. The thick line 
described the actual four cells simulated. 

negative, in fact for this circuit 0). This situation corresponds 
to point a in Fig. 4; no electron switching event happens and 
the cell remains in the null state, holding no information. When 
the clock is then lowered (more negative) the system moves 
along the line shown through point b. An electron is switched 
to either top dot or bottom dot, decided by the input; the cell is 
then in the active state. If the clock is held very negative (point 
c), the electron is locked in the active state, since the energy 
barrier in the middle dot is too high to overcome. The locked 
cell is essentially a single bit memory — its present state 
depends on its state in the recent past, not on the state of 
neighbors. Varying clock potential gradually between point a 
and c will switch the cell between null, active and locked state 
adiabatically. 

A QCA shift register can be constructed with a line of 
capacitively coupled half QCA cells shown in Fig. 3b, where 
the output from each cell acts as the input to its right neighbor. 
The transport of information from cell to cell is controlled by 
clock signals. Initially, all the cells are in the null state since the 
clocks are high. Then an input signal is applied to the first cell 
and the clock for the first cell is lowered. The first cell thus 
switches to the opposite state of the input and holds to that state 
even when input is removed. 

When the clock for the second cell is lowered, the second 
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cell switches to the opposite state to the first cell accordingly 
and locks the bit. The information is thereafter propagated 
along the cell line by the clock signals. Each cell in turn 
copies (an inverts) a bit from its neighbor to the left when 
the left neighbor is in the locked state and erases the bit, i.e., 
returns to the null state, while its right neighbor still holds a 
copy (inverted) of the bit. The copying of the bit can be 
accomplished gradually so that the switching cell is always 
close to its instantaneously ground state and thus dissipates 
very little energy. 
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Vol. 8, No. 8,2010 
of the bit, the cell potential in the middle cell decreases slightly 
(in absolute value) while the cell potential in its left and right 
neighbor increase slightly (thus the small Bnotch A in the center 
of the flat parts of the waveform).. 




50 100 150 2QQ 



Vin (mv) 

Fig. 4. The equilibrium state configuration of a triple dot cell 
described in Fig. 6. (nl, n2, n3) are the number of charges in the top, 
middle and bottom dot, respectively. The cell is in the null state in 
point a. The cell is in the active state in point b. The cell is in locked 
state in point c. 

It_s instructive to model a semi-infinite shift register in 
order to study the robustness in the QCA circuit. A four phase 
clocking scheme is adopted to achieve adiabatic switching, 
shown in Fig. 5. Each clock signal is shifted a quarter-period. 
As a bit moves down the shift register, we need model only a 
four QCA half-cells at a time, since by the time the bit is 
latched in the leading cell, the leftmost cell has returned to null. 
This is equivalent to viewing the simulation as occurring on a 
ring of four half-cells. Figure 6 shows the time evolution of cell 
potentials for four neighboring cells in a semi-infinite shift 
register. The shaded areas indicate stored bit information. Each 
cell has the opposite signal to its neighboring cells with a 
quarter period shift; the information is both copied and 
inverted. 

The arrow indicates the direction of the information flow. At 
the end of the first quarter clock period, the first clock is set to 
lowso that the first cell latches the input and locks itwhile the 
second cell is in the null state. By the time the second clock is 
low, the first cell is still kept locked. The second cell thus 
copies the bit from the first cell. By the end of the third quarter 
period, the bit in the first cell is erased as its clock is set to 
high. The third cell copies the bit from the second cell and 
holds it. The process goes on and the bit information is 
transported along the chain. Note that there are always at least 
two copies of the bit at one time. When there are three copies 
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Fig. 5. A four phase clocking scheme in metal-dot QCA. 

V. Operation of Semi-infinite QCA Shift Register 

A. Effect of Temperature and Speed 

Because of the difficulty of fabricating small capacitors, 
metal-dot QCA circuits operate at low temperatures. 




0.5 0.75 I L.Z5 J. 5 

Time/Clock period 

Fig. 6. Time evolution of cell potential in the neighboring cells. Vcell 
(n) is the differential potential between the top and the bottom dot of 
the nth cell. 
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Fig. 8. Deviation from unity power gain for an individual cell as a 
function of temperature 

Thermal excitation is therefore a potential source of random 
error in metal-dot QCA circuits, and it is clear that at high 
enough temperatures the circuit will fail. It is tempting to 
conclude that for a long line of cells, failures are unavoidable at 
any non-zero temperature. It is well known that there is no 
long-range order in one-dimensional systems [15]. 

While the energy for a mistake might be higher than kBT, 
the degeneracy (and therefore entropy) of mistake states 
increases as the system size expands. For a system in thermal 
equilibrium therefore, the free energy of the mistake states 
eventually become lower than the mistake-free zero-entropy 
ground state [7]. A static (unclocked) chain of QCA cells 
therefore has, for any non-zero temperature, a characteristic 
length (_ eEk=kBT) after which mistakes become very likely. 
But a clocked line is not in thermal equilibrium — it is actively 
driven. The clock can supply energy to the system to restore 

8 
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signal states. 

To see the effect of temperature on the performance of the 
clocked semi-infinite shift register, we here solve the time- 
dependent problem of the clocked shift register using the 
master-equation (Eq. 3) approach described in Section II. The 
calculated cell potential (see Fig. 3) of the kth cell in the chain 
at time t is Vcell(k,t). When each cell in the chain in turn 
latches the bit the cell potential is at its largest magnitude. 
Figure 7 shows this maximum cell potential 
Vcell(k)=max(|Vcell(k,t)|) as a function of cell number k down 
the chain. The calculated response is plotted for various values 
of the temperature. The cell potential is higher at the very 
beginning of the chain simply because the first cell is driven by 
an input voltage which is a stronger driver than subsequent 
cells see; they are driven by other cells. At temperatures above 
10 K the cell potential decays with distance as information is 
transported along the chain. At each stage the signal 
deteriorates further, and for a long shift register the information 
will be lost. For individual cells, this means errors due to 
thermal fluctuations become increasingly more likely. As the 
temperature is lowered the signal decay-length increases. At 
temperatures below 5 K, however, the behavior appears 
qualitatively different — the cell potential remains constant 
along the long the chain. 

To the accuracy of our calculation for a large but finite 
number of cells, no signal degradation appears at all. The 
degradation of performance with increasing temperature can be 
explained in terms of power gain. We calculate the power gain 
of each individual cell in the chain by directly calculating the 
work done on the cell by its neighbor to the left, and the work 
done by the cell on its neighbor to the right. For each operating 
temperature the power gain is the same for each cell (apart 
from those very near the beginning of the line). If the power 
gain is precisely 1 (or greater), then there is no signal 
degradation moving down the line. At each cell, power is 
drawn from the clock sufficient to completely restore the signal 
as it is copied to the next cell. We refer to the situation in 
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Fig. 9. The phii.se diagram of the operation space as. a function of temperature and clock period when a. Cj=1.6 aF, and b. Cj=0.16 aF. The shaded area 
below the curve is where the circuit succeeds and the white area is where the circuit fails. 
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Fig. 10. Cell potential as a function of cell number at 4 

which unity power gain enables transmission of signals over 
arbitrarily long distances as Brobust A operation. If the power 
gain is less than 1, then the signal will be degraded as it moves 
down the line. Figure 8 shows the deviation from unity power 
gain as a function of temperature on a logarithmic scale. For 
temperatures below 5 K the power gain is 1; above 5 K, the 
power gain is less than 1. At higher temperatures, the flow of 
energy from the clock can no longer compensate for the energy 
loss to the thermal environment, with the result that the signal 
decays at each stage. As the difference between the power gain 
and 1 becomes small our analysis is limited by the numerical 
accuracy of the calculation. Nevertheless, the exponential 
character of the approach to unity power gain supports the 
interpretation that this transition is a qualitative change 
between robust and non-robust behavior, analogous to a phase 
transition. 

The time-dependent calculation above is repeated for various 
temperatures and clock speeds to generate the phase diagram of 
the operational space of the circuit shown in Fig. 9. We display 
the results for the circuit with our standard parameters, with 
Cj=1.6 aF in Fig. 9a and for more aggressively scaled 
parameters, with Cj=0.16 aF in Fig. 9b. All capacitances and 
voltages in the circuit are scaled appropriately with Cj. The 
aggressively scaled parameter calculation illustrates scalability 
of QCA circuits. 

The performance of the circuit will increase with smaller 
capacitances. The shaded area below the curve indicates speeds 
and temperatures for which the circuit is robust. The white area 
represents non-robust operation for which bit information 
decays along the chain. The two figures are identical except for 
the scale: the aggressively scaled circuit of Fig. 9b operates ten 
times faster and at a temperature ten times higher than the 
circuit in Fig. 9a. The area of robust operation is limited by 
both speed and temperature. In Fig. 9a, when the clock period 
is less than about 0.2 ns (corresponding to 5 GHz), the circuit 
fails (is not robust) even at zero temperature. This occurs as the 
clock period approaches the electron tunneling rate. When the 
clock speed is too fast, the electrons do not have enough time 
to tunnel reliably from one dot to another. The error probability 
accumulates as the information moves along the chain. 



K when capacitance variation is li ■ 10% b ■ 15%. 

Increasing the clock period increases the probability of 
electrons being in the Bright A states. This improvement quickly 
saturates and further increasing the clock period has no effect 
since the electrons have had enough time to be in the correct 
state. 

The tunneling rate is related to the tunnel resistance, so this 
description is equivalent to the observation that the speed is 
limited to the RC time-constant of the circuit. 

B. Defect Tolerance in the QCA Shift Register 

A robust circuit must be tolerant of defects that introduce 
variations in the values of the designed parameters. We 
consider the situation of a very long shift register in which the 
value of each capacitor in the circuit is varied randomly within 
a fixed percentage range from its its nominal value. The circuit 
is robust if the perturbation of the capacitances does not 
influence the performance of the circuit. We choose a working 
point in Fig. 9a where clock period is 5 ns, the temperature is 4 
K, and vary all the capacitances randomly by T10 and T15%. 
Figure 10 shows the cell potential as a function of cell number 
with random capacitance variation. Different color represents 
different capacitance variation within the certain percentage 
range. When the deviation is T10%, the circuit is robust and 
transmits bit information with no errors. The bit information is 
carried on correctly even at the 2,000th cell. When the 
deviation increases to T15%, the circuit is fragile since cells are 
flipped to the wrong states during propagation. This calculation 
demonstrates, again as a result of the power gain in each cell, 
that QCA circuits can tolerate considerable variation in 
parameter values and still function correctly. 



VI. Conclusion 

The QCA approach represents an entirely new way of 
encoding, moving, and processing binary information. As more 
experimental realizations of devices appear, attention naturally 
turns to the broader circuit behavior of these new devices. 
While molecular QCA may represent the most realistic long- 
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term system for robust room temperature operation, the metal- 
dot QCA system provides an extremely valuable prototype 
system in which to explore QCA properties. Metal dot systems 
also have the advantage of being realizable now. 

We have explored here the behavior of metal-dot QCA 
systems under stress — stressed by high temperature operation, 
high speed operation, and random variation in parameter 
values. In each case enough stress destroys the correct 
operation of the circuit. What we observe however is that these 
systems are not terribly fragile, they can survive in a broad 
range of operational space. In each case small errors threaten to 
accumulate over many cells and result in signal loss. The key 
feature is power gain from the clocking circuit which provides 
considerable robustness against these error mechanisms, 
restoring signal levels at each stage 
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Abstract*. Association rule mining is used to find relationships 
among items in large data sets. Frequent patterns mining is an 
important aspect in association rule mining. In this paper, an 
efficient algorithm named Apriori-Feed Forward(AFF) based on 
Apriori algorithm and the Feed Forward Neural Network is 
presented to mine frequent patterns. Apriori algorithm scans 
database many times to generate frequent itemsets whereas 
Apriori-Feed Forward(AFF) algorithm scans database Only 
Once. Computational results show the Apriori-Feed 
Forward(AFF) algorithm performs much faster than Apriori 
algorithm. 

Keywords: Association rule mining, dataset scan, frequent 
itemsets, Neural Network.. 

I. INTRODUCTION 

Data mining has recently attracted considerable attention 
from database practitioners and researchers because it has 
been applied to many fields such as market strategy, financial 
forecasts and decision support [1]. Many algorithms have been 
proposed to obtain useful and invaluable information from 
huge databases [2]. One of the most important algorithms is 
mining association rules, which was first introduced in [3, 
4] .Association rule mining has many important applications in 
our life. An association rule is of the form X => Y. And each 
rule has two measurements: support and confidence. The 
association rule mining problem is to find rules that satisfy 
user-specified minimum support and minimum confidence. It 
mainly includes two steps: first, find all frequent patterns; 
second, generate association rules through frequent patterns. 
Many algorithms for mining association rules from 
transactions database have been proposed [5, 6, 7] since 
Apriori algorithm was first presented. However, most 
algorithms were based on Apriori algorithm which generated 
and tested candidate itemsets iteratively. This may scan 



database many times, so the computational cost is high. In 
order to overcome the disadvantages of Apriori algorithm and 
efficiently mine association rules without generating candidate 
itemsets, many authors developed some improved algorithms 
and obtained some promising results [9,10, 11, 12, 13]. 
Recently, there are some growing interests in developing 
techniques for mining association patterns without a support 
constraint or with variable supports [14, 15, 16] .Association 
rule mining among rare items is also discussed in [17,18]. So 
far, there are very few papers that discuss how to combine 
Apriori algorithm and Neural Network to mine association 
rules. In this paper, an efficient algorithm named Apriori- 
Feed Forward(AFF) based on Apriori algorithm and Feed 
Forward Neural Network is proposed, this algorithm can 
efficiently combine the advantages of Apriori algorithm and 
Structure of Neural Network. Computational results verify the 
good performance of the Apriori-Feed Forward(AFF) 
algorithm. The organization of this paper is as follows. In 
Section II, we will briefly review the Apriori method and Feed 
Forward Neural Network method. Section III proposes an 
efficient Apriori-Feed Forward(AFF) algorithm that based on 
Apriori and the Feed Forward(AFF) structure. Experimental 
results will be presented in Section IV. Section V gives out the 
conclusions. 



II. Classical Mining Algorithm and Neural Network 

A. Apriori Algorithm 

In [4], Agrawal proposed an algorithm called Apriori to the 
problem of mining association rules first. Apriori algorithm is 
a bottm-up, breadth-first approach. The frequent itemsets are 
extended one item at a time. Its main idea is to generate A>th 
candidate itemsets from the (A>l)-th frequent itemsets and to 
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find the A:-th frequent itemsets from the A>th candidate 
itemsets.The algorithm terminates when frequent itemsets can 
not be extended any more. But it has to generate a large 
amount of candidate itemsets and scans the data set as many 
times as the length of the longest frequent itemsets. Apriori 
algorithm can be written by pseudocode as follows. 

Procedure Apriori, Input: data set D, minimum support 
minsup, Output: frequent itemsets L 

(1) 1 L = findfrequentlitemsets(D); 

(2) for (k = 2;Lkl -± (p; k++) 
(3){ 

(4) Ck = Apriori_gen( Lk -1 , minsup); 

(5) for each transactions t _ D 
(6){ 

(7) Ct = subset( Ck , t); 

(8) for each candidate c Ct 

(9) c.count++; 
(10)} 

(11) Lk= {c _Ck\ c. count > minsup}; 

(12) } 

(13) return L = { LI _ L2 _ ... Ln }; 

In the above pseudocode, Ck means A:-th candidate itemsets 
and Lk means A>th frequent itemsets. 

B. Neural Network 

Neural network[ 19,20] is a parallel processing network 
which generated with simulating the image intuitive thinking 
of human, on the basis of the research of biological neural 
network, according to the features of biological neurons and 
neural network and by simplifying, summarizing and refining. 
It uses the idea of non-linear mapping, the method of parallel 
processing and the structure of the neural network itself to 
express the associated knowledge of input and output. 
Initially, the application of the neural network in data mining 
was not optimistic, and the main reasons are that the neural 
network has the defects of complex structure, poor 
interpretability and long training time. But its advantages such 
as high affordability to the noise data and low error rate, the 
continuously advancing and optimization of various network 
training algorithms, especially the continuously advancing and 
improvement of various network pruning algorithms and rules 
extracting algorithm, make the application of the neural 
network in the data mining increasingly favored by the 
overwhelming majority of users. 

C Neural Network Method in Data Mining 

There are seven common methods and techniques of data 
mining[2 1,22,23] which are the methods of statistical analysis, 
rough set, covering positive and rejecting inverse cases, 
formula found, fuzzy method, as well as visualization 
technology. Here, we focus on neural network method. Neural 
network method is used for classification, clustering, feature 
mining, prediction and pattern recognition. It imitates the 
neurons structure of animals, bases on the M-P model and 
Hebb learning rule, so in essence it is a distributed matrix 
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structure. Through training data mining, the neural network 
method gradually calculates (including repeated iteration or 
cumulative calculation) the weights the neural network 
connected. The neural network model can be broadly divided 
into the following three types: 

(1) Feed-forward networks: it regards the perception back- 
propagation model and the function network as 
representatives, and mainly used in the areas such as 
prediction and pattern recognition; 

(2) Feedback network: it regards Hopfield discrete model and 
continuous model as representatives, and mainly used for 
associative memory and optimization calculation; 

(3) Self-organization networks: it regards adaptive resonance 
theory (ART) model and Kohonen model as representatives, 
and mainly used for cluster analysis. 



D. Feedforward Neural Network : 

Feedforward neural network (FF network) are the most 
popular and most widely used models in many practical 
applications. They are known by many different names, such 
as "multi-layer perceptrons." 

Figure 2(a) illustrates a one-hidden-layer FF network with 

inputs X I ,..., x n and output J> . Each arrow in the figure 
symbolizes a parameter in the network. The network is divided 
into layers. The input layer consists of just the inputs to the 
network. Then follows a hidden layer, which consists of any 
number of neurons, or hidden units placed in parallel. Each 
neuron performs a weighted summation of the inputs, which 

then passes a nonlinear activation function^, also called the 
neuron function. 
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Figure 2(a) A feedforward network with one hidden layer and 
one output. 

Mathematically the functionality of a hidden neuron is 
described by 



£ Wj ^ + b j 



:=i 



where the weights { W J ,J} are symbolized with the arrows 
feeding into the neuron.The network output is formed by 
another weighted summation of the outputs of the neurons in 
the hidden layer. This summation on the output is called the 
output layer. In Figure 2(a) there is only one output in the 
output layer since it is a single-output problem. Generally, the 
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number of output neurons equals the number of outputs of the 
approximation problem. The neurons in the hidden layer of the 
network in Figure 2(a) are similar in structure to those of the 
perception , with the exception that their activation functions 
can be any differential function. The output of this network is 
given by 
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Figure 2(b) 



V" 1 f n 



1=1 



:=i 



+ b^ 



where n is the number of inputs and nh is the number of 
neurons in the hidden layer. The variables { Wi *J , J**, w i & } are 
the parameters of the network model that are represented 
collectively by the parameter vector 6 . In general, the neural 
network model will be represented by the compact notation 
g( e ,x) whenever the exact structure of the neural network is not 
necessary in the context of a discussion. Note that the size of 
the input and output layers are defined by the number of inputs 
and outputs of the network and, therefore, only the number of 
hidden neurons has to be specified when the network is 
defined. The network in Figure 2(a) is sometimes referred to 
as a three-layer network, counting input, hidden, and output 
layers. However, since no processing takes place in the input 
layer, it is also sometimes called a two-layer network. To 
avoid confusion this network is called a one-hidden-layer FF 
network throughout this documentation. 

In training the network, its parameters are adjusted 
incrementally until the training data satisfy the desired 
mapping as well as possible; that is, until ? (°) matches the 
desired output y as closely as possible up to a maximum 
number of iterations. 

The nonlinear activation function in the neuron is usually 

chosen to be a smooth step function. The default is the 

standard sigmoid 

1 
5igmoid[x] 



1 + e- K 
that looks like this. 

«NeuraIHet™rks' 

Plot [Sigmoid [x], {x, -8, 8}] 



III. Apriori- Feedforward Algorithm(AFF) 

In this Section, a new algorithm based on Apriori and the 
Feedforward Neural Network structure is presented, which is 
called Apriori- Feedforward Algorithm(AFF).Fig.3a shows the 
database structure in which there are different Item Ids and 
sets of Items purchased against Item ID and Fig 3(a) 
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Fig 3(a): Item Id and Item List of Database 
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FIG 3(B): Data Structure of Nodes in FeedForward Neural 
Network 

The Apriori- Feedforward algorithm mainly includes two 
steps. 

First, a neural network model is prepared according to the 
maximum number of items present in the dataset. Then first 
transaction of the data set is scanned to find out the frequent 1 
itemsets, and then neural network is updated for frequent 2 
itemsets frequent 3 itemsets and so on. The data set is scanned 
only once to build all frequent combinations of datasets. 
While updating frequent 2/frequent 3 itemsets..., its pruning is 
done at the same time to avoid redundancy of item sets. At 
last, the built Neural Network is; 9 mined by Apriori- 
FeedForward Algorithm. The detailed Apriori-FeedForward 
Algorithm is as follows. 

Procedure : Create_Model 

Input: data set D, minimum support minsup 

Output: 

(1) procedure CreateModel(n) 

(2)for(i=l;i^ ;i++) 

(3) for each itemset lj G l k _! 

(4) for each itemset 1 2 E l k _! 

(5) if( Ml] = 1 2 [1]) ( \ x [2\ = 1 2 [2]) ( l x [3] = 1 2 [3]) 
(li[n] = l 2 [n]) 
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(6) then 

(7) C = l lX l 2 

(8) if already_generated(l 1 x 1 2 ) then 

(9) delete C 

(10) else add C to C k 
(ll)FNN(C k ) 

Procedure FNN(C k ) 
Input: itemset Model C k 
Output: frequent itemsets L 

(1) procedure FNN(C k ) 

(2) n = recordcount(Dataset) 
(3)for(i=l;i<n;i++) 

(4){ 

(5) Li = getfirsttransaction(Dataset) 

(6) upf = update_frequecy(l! x 1 2 ) 

(7) if (upf >= minsup) 

(8) print(li x 1 2 ) 

In this algorithm a complete Feed forward neural network is 
prepared according to the maximum number of items present 
in the datasets. First layer of the network is a frequent 1 
itemsets second layer is frequent 2 item sets and so on until a 
final layer is prepared which is a single node comprising of all 
items present in the datasets. Every n+1 layer is combination 
of item n with respect to all other items present at that layer, 
these layers are generated by calculating factorial of n+1 
items. 
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Fig.4(b). 
From Fig.4(a), 4(b). we can make the following two 
statements. First, Apriori- FeedForward algorithm works much 
faster than Apriori. It uses a different method FNN to calculate 
the support of candidate itemsets and it consumes less memory 
than Apriori because it doesn't need to traverse database again 
and again. It needs only single scan to the database. 

v. Conclusion and future work 

In this paper, we have proposed the Apriori- FeedForward 
algorithm. This method builds Feed Forward Neural Network 
Model and scans the data base only once to generate frequent 
patterns. The future work is to further improve the Apriori- 
Feed Forward algorithm and test more and larger datasets. 



iv. Experimental Results 

The content of our test data set are frequently purchased items 
of a super market. There are 7 to 12 different items and 10000 
to 50000 records in that data set. In order to verify the 
performance of the Apriori - FeedForward algorithm, we 
compare Apriori-Feed Forwrd with Apriori. The algorithms 
are performed on a computer with i7 processor 1.60GHz and 4 
GB memory. The program is developed by NetBeans 6.8. The 
computational results of two algorithms are reported in Table 
l.The clearer comparison of two algorithms is given in 
Fig.4(a). Table 1. The running time of two 

algorithms Apriori - FeedForward algorithm 



Min.Supp 


Apriori 


AFF 


30% 


15000ms 


1762ms 


25% 


15000ms 


1545ms 


20% 


15000ms 


1529ms 


15% 


19000ms 


1682 ms 


10% 


20000ms 


1634 ms 


5% 


30000ms 


1625ms 



Fig.4(a).Table 1. 
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Abstract — With the growth of internet and multimedia, 
compression techniques have become the thrust area in the fields of 
computers. Image compression is a technique of efficiently coding 
digital image to reduce the number of bits required in representing 
image. Many image compression techniques presently exist for the 
compression of different types of images. In this paper Vector 
Quantization based compression scheme is introduced. In this 
scheme a low bit rate still image compression is performed by 
compressing the indices of Vector Quantization and residual 
codebook is generated. The indices of VQ are compressed by 
exploiting correlation among image blocks, which reduces the bit per 
index. A residual codebook similar to VQ codebook is generated that 
represents the distortion produced in VQ. Using this residual 
codebook the distortion in the reconstructed image is removed, 
thereby increasing the image quality. The proposed technique 
combines these two methods and by replacing the Modified k-means 
algorithm for LBG in the codebook generation. Experimental results 
on standard image Lena show that the proposed scheme can give a 
reconstructed image with a higher PSNR value than all the existing 
image compression techniques. 

Keywords — Image compression, Vector Quantization, Residual 
Codebook, Modified K-Means 



I. 



Introduction 



IMAGE compression is a method of efficiently coding 
digital image, to reduce the number of bits required in 
representing image. Its purpose is to decrease the storage 
space and transmission cost while maintaining good quality. 
VECTOR Quantization [1] has been found to be an efficient 
technique for image compression in the past decade. VQ 
compression system mainly contains two components: VQ 
encoder and decoder as shown in Fig.l. 

In VQ technique [2] [3], the input image is partitioned into 
a set of non-overlapping image blocks X = {x^.x l .,..,x^_{l 

of size 4x4 pixels each and a clustering algorithm, for example 
Linde-Buzo-Gray (LGB) algorithm [5] and Modified K- 
Means [2]. The Modified K-Means algorithm is used in the 
proposed technique, to generate a codebook 
C = [Y^Y*. V.v-i- f° r ^e given set of image blocks. The 

codebook C comprises a set of representative image blocks 
called codewords. The VQ encoder discovers a closest match 
codeword in the codebook for each of the image block and the 
index of the codeword is transmitted to VQ decoder. The 
decoder phase has the following functionalities. VQ decoder 
replaces the index values with the respective codewords from 



the codebook and produces the quantized image, called as 
reconstructed image. In order to attain low bit rate, many VQ 
schemes, have been used in the past literature such as side- 
match VQ (SMVQ) [6], classified SMVQ (CSMVQ) [7] and 
Gradient based SMVQ (GSMVQ) [8]. 



Image Blocks 





Codebook 

C={ci,c 2 ..c n } 




LBG /Modified 
K-Means 




^ 


VQ 


Indices 


w 


Encoder 


w 



(a) 



Codebook 

C={co,c 2 ..c N _i] 



LBG /Modified 
K-Means 



Indices 



VQ 

Decoder 



Reconstructed Image 

► 



(b) 



Fig. 1 (a) VQ Encoder (b) VQ Decoder 

Even though, SMVQ, CSMVQ, GSMVQ and JSPVQ are 
the low bit rate schemes, they require high encoding time than 
that of VQ method. In this paper, an efficient low bit rate 
image compression scheme is proposed based on VQ that 
makes use of compression of indices of VQ and residual 
codebook with modified k-means clustering instead of LGB. 
This scheme attains low bit rate and better image quality than 
previous VQ methods. 

The rest of the paper is presented as follows: in section II, 
the literature survey is presented. In section III the proposed 
compression scheme is described. Performance of the 
proposed system is evaluated in section IV and section V 
concludes the paper. 
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II. 



Literature Survey 



Somasundaram et al, [1] presented a novel vector 
quantization approach and also explained various VQ 
schemes, such as side-match VQ (SMVQ), classified SMVQ 
(CSMVQ) and Gradient based SMVQ (GSMVQ). In which 
SMVQ uses the high correlation existing between neighboring 
blocks to achieve low bit rate and master codebook C is used 
to encode image blocks in the first column and first row in 
advance. The other image blocks are encoded, utilizing the 
correlation with the neighboring encoded image blocks. Let x 
be the input image block for the compression system, and u 
and 1 be the upper and left neighboring codewords 
respectively. Let the size of the given image block size be k = 
mx n. The side-match distortion of a codeword Y can be 
defined as: 

v V ^ (1) 

According to their side-match distortions of all codewords 
SMVQ sorts the codewords and then selects N s codewords 
with smallest side-match distortions from the master book C 
of size N to form the state codebook SC, where N s < N. A 
best-match codeword Yj is selected to encode an image block 
x from N s codewords and the corresponding index is coded in 
log 2 N s bits. Thus, the SMVQ reduces the bit rate of VQ. Since 
mean square error caused by state codebook is higher than that 
of master codebook, SMVQ degrades the image quality and 
also it requires long encoding time. Classified side-match 
vector quantization [7] (CSMVQ) is an efficient low bit rate 
image compression technique which produces relatively high 
quality image. It is a variable rate SMVQ and makes use of 
variable sized state codebooks to encode the current image 
block. The size of the state codebook is decided based on the 
variances of left codewords and upper codewords that predict 
the block activity of the input blocks. Also, CSMVQ uses two 
master codebooks, one for low detail blocks and another for 
high detail blocks. Another variant, gradient-based SMVQ [8] 
(GSMVQ) has been proposed, in which gradient values are 
used instead of variance values to predict the input vector. 
Another low bit rate VQ, called Jigsaw-puzzle vector 
quantization (JPVQ) [9] was proposed,in which an input block 
can be coded by the super codebook, the dynamic codebook or 
the jigsaw-puzzle block. The jigsawpuzzle block is 
constructed dynamically using four- step side match prediction 
technique. 

Interpolative vector quantization, first proposed explicitly 
by Gersho in [12], introduces dimension reduction to 
traditional VQ. The codebook in the encoder is learned on 
downsampled vectors and the codebook in the decoder on 
high-dimension vectors. Except for the difference on 
dimension, the two codebooks have the same number of 
representative vectors and structure. VQ encoder maps down 
the sampled inputs to a set of scalar indices and VQ decoder 
reproduces high-dimension inputs by received indices. David 



et al. applied IVQ to image restoration [13], where the encoder 
does not need a codebook except for some parameters. The 
codebook at decoder is learned on image pairs consisting of an 
original image and its diffraction-limited counterpart. Several 
follow-up work is reported in [14] [15]. 

The goal of quantization is to encode the data from a 
source, with some loss, so that the best reproduction is 
obtained. Vector quantization (VQ) achieves more 
compression then scalar quantization [14], making it useful for 
band-limited channels. The algorithm for the design of optimal 
VQ is commonly referred to as the Linde-Buzo-Gray (LBG) 
algorithm, and it is based on minimization of the squared-error 
distortion measure. The LBG algorithm starts with an initial 
codebook and iteratively partitions the training sequence into 
the Voronoi regions to obtain a new codebook that produces a 
lower distortion. Once the final codebook is got, it can be used 
on new data outside the training sequence with the optimum 
nearest neighbor rule. If the training sequence is sufficiently 
long, it yields good performance for future data produced by 
the source. 

In the paper by M.Antonini, et al. [18], images have been 
coded using two-level wavelet decomposition with VQ of the 
resulting coefficients. A multiresolution codebook has been 
designed with noise- shaping bit allocation among the various 
subbands. The test image has been coded at a rate of 0.78bpp, 
achieving a PSNR of 32.1dB. In the paper by Gersho and 
Ramamurthy [19], images have been compressed using 
unstructured VQ, achieving a bitrate of 0.5 - 1.5bpp. Ho and 
Gersho [19] have used multistage VQ for progressive image 
coding, with a PSNR of 30.93dB at 0.36bpp using 4 stages. 
R.L.Claypoole et al. [21] have coded images using nonlinear 
wavelet transform via lifting and obtained 30.5dB at 0.6bpp. 
An adaptive lifting scheme with perfect reconstruction has 
been used in [21]. 



III. 



Methodology 



The compression scheme consists of two components, 
compression of indices and generation of residual codebook. 
These two are explained in this section. 

3.1. Compression of Indices 

The Index compression step has the following procedure. 
When the image blocks are vector quantized, there likely to 
exist high correlation among the neighboring blocks and hence 
among the corresponding codeword indices. Therefore, if 
indices are coded by comparing with the previous indices, 
further reduction in the bit rate can be achieved. In Search 
Order Coding (SOC) [11], a simple searching scheme is 
followed to find a match for the current index from the 
previous indices. The search order SO is defined as the order 
in which the current index is compared with the previous 
indices. 
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Fig. 2 Searching Order 

The SO used in [11] is given in Fig.2. The label "1" 
indicates the highest searching priority, "2" denotes the second 
highest priority and so on. In order to limit comparisons of the 
current index with previous indices, the searching range (SR) 
is fixed. The SR is defined as the number of previous indices 
to be compared with current index. In this paper Search Order 
[11], SR is taken as 10, which gives the lower bit rate than 
other SR. 

In this method, the index of the codeword of a block is 
encoded exploiting the degree of the similarity of the block 
with previously encoded upper or left blocks. When the degree 
of similarity of the current block with one of the two 
previously encoded blocks is high, the index of the codeword 
of the block is encoded using the index of the neighboring 
codeword. I.e. the codeword index of the current block and 
that of the neighboring blocks are same. If the degree of 
similarity of the block with the neighboring blocks is not high, 
it is assumed that the closest match codeword of the current 
block may be nearer to the codewords of the neighboring 
blocks. For example, if one of the two neighboring blocks 
codeword's index is 'N', the closest match codeword of the 
block to be encoded may lie between (N-J) th codeword and 
(N+J) th codeword in the codebook, where J is any arbitrary 
number. So the index can be coded in log 2 ( 2*J ) bits. This 
idea is based on the property existing in the codebook design 
using LBG algorithm with splitting technique. In the splitting 
technique, bigger size codebook is generated by splitting each 
codeword of the smaller codebook into two. The size of the 
codebook is always in powers of two (2 M — » 2 (M+1) ). Hence, 
relatively similar two image blocks may have same closest 
match codeword in Jth position at codebook of size 2 M and at 
codebook of size 2 (M+1) , one of the two image blocks may have 
its closest match codeword at Jth place in the codebook and 
other block's codeword may be in (J+l)th place. The other 
non- similar blocks are encoded using their original index 
value. In this scheme, examining the resemblance of a block 
with its left and upper blocks is not required to encode the 
index of the block. The above description is the idea behind 
our VQ indices compression scheme. In order to implement 
this idea, the index to be coded is compared with previous 
indices according to the SO given in Fig.2 and SR is fixed as 2 
in this scheme. Let 1, 2,.., 12 be the SO and ind_val (1), 
ind_val (2),..ind_val(12) be the indices values of the SO = 
1,2,. . . 12. The following steps is used to encode VQ index. 



1 Get the first index generated by the VQ encoder and transmit 
as such. 

2. Get the next index generated by VQ Encoder. Compare this 
index with the previous indices according SO 

3. if SO = 1, code it as "00" and go to the step 2 

else 

if SO = 2, code it as "01" and go to the step 2 
else go to the next step. 
4 if index value < (ind_val (SO = 1) + J) and 
index value > — (ind_val (SO = 1)+J) 
{ if ind_val (SO =1) = ind_val (SO=2) 
code it as "10" followed by log (2 * ) 2 J bits 
else 

code it as "100" followed by log (2 * ) 2 J 
bits. } 

go to step 2. 
else 

if index value < (ind_val (SO = 2) + J) and 
index value > - (ind_val (SO = 2)+J) 
code it as "101" followed by log (2 * ) 2 J bits 
and go to step 2. 
else 

code it as "11" followed by its original index and 
goto step 2. 

Decoding of the compressed indices is done by reversing the 
above coding steps. 

3.2. Construction of Residual Codebook (RC) 

The residual codebook can be represented 
as JK = {ffir , . J Jtt f ', 1 . 1 ..ffl^_ : }. Residual codebook is 

constructed using absolute error values caused by VQ method, 
in the residual codebook construction, the image blocks that 
are less similar to their closest match codewords found in the 
codebook are taken into account. Less similarity blocks will 
increase distortion than high similarity blocks in the 
reconstructed image. Residual codeword (RYj) for a less 
similarity image block is constructed by comparing it with its 
closest match codeword. The collection of residual codewords 
RYj, RY i+ i... is called residual codebook. Similarity of an 
image block x with its closet match codeword Yj is determined 
based on minimum distortion rule (a) between them. If the 
mean square error (a) of an image block is greater than a 
predefined threshold value (o), then the block is taken as less 
similarity block. 



Let t; = Qr 2 



nXfc. 



. be a k-pixels image block and 



Y t = (y; ->■*.■ — *}\ J ^ e a k-pixels closest match codeword, 

(2) 



then the a is defined as: 



=£>'-«*■ 



The steps used for constructing residual codebook are 
given below. 
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Step 1: An image which is to be compressed is 
decomposed into a set of non-overlapped image blocks of 4x4 
pixels. 

Step 2: A codebook is generated for the image blocks 
using LBG algorithm. 

Step 3: Pick up the next codeword Y t from the codebook 
C and find its all closest match less similarity image blocks(X) 
found out using (2) from the given set of image blocks and 
construct residual codeword RY t using the following equation. 

: v . . (3) 

-. 

where k represents the number of elements in the 
codeword Y t and the image block Xi respectively and m 
denotes the number of less similarity image blocks that are 
closer to the codeword Y t . 

Repeat the step 3 until no more codeword exists in the 
codebook. Since residual codeword RYj is constructed only 
for less similarity image blocks, some of the codewords Yj 
may not have their respective residual codewords, i.e; these 
codewords may not have less similarity image blocks. In 
residual codebook construction, only absolute values of the 
residuals of the less similarity image blocks are used. The sign 
information for each less similarity image block is preserved 
and is called residual sign bit plane. In encoding phase, for 
each less similarity image block, pixels of the block are 
subtracted from the corresponding pixel values of the 
codeword Y i? then sign values (positive or negative) of the 
residual values of that block, called residual sign bit plane, are 
preserved. To reduce the bits needed for residual sign bit 
plane, only alternate bits are stored and others are dropped 
based on the assumption that there exists correlation among 
neighboring bits. The bits used for prediction is shown in 
Fig. 3. In the decoding process, the bits of the residual sign bit 
plane of a block are replaced with the respective residual 
values of the residual codeword from the residual codebook 
(RC) with appropriate sign. The residual values of the dropped 
bits are predicted from neighboring residual values using 
following steps. 



\pv\S) = 






-LpL-fc) = 



rruiA) + rr v{C) -£- tTL?(> ) 



rruUD-h rrvjji) 



rrvKA} +■ ttv{I) +■ m?{F) 



ttpW) fm?(0^ rrviF) +■ m(K} 






rrvU) +■ rrvVi) +■ rru (JF) ■+■ m m (K) 



rrvlji) -h rrviJO — rrv{P} 






>■>-,- ;: _ w L - ;■;■ 



m?(*V; +■ rrviK) + rri?(P) 



where pv (*) is the pre assigned value of the corresponding 
bit in the residual sign bit plane and rrv (*) is the respective 
reconstructed residual value of the bit in the residual sign bit 
plane. 

© * © » 
i © * © 
j ® L 
* © ° ® 

Fig. 3 Bits encircled are used for prediction 

After reconstructing the residual codeword, each value of 
the residual codeword is added to respective value of the 
closest match codeword of the block Since the residual sign 
bit plane for each image block has only eight bits, alternate 
residual values in the residual codeword RY t are dropped and 
it also reduces the cost of storing residual codebook. The 
dropped residual values are predicted from the neighboring 
residual values as given above. 

3.3. Modified K-Means Algorithm to Replace LBG 
Initial Cluster Centers Deriving from Data Partitioning 

The algorithm follows a novel approach that performs data 
partitioning along the data axis with the highest variance. The 
approach has been used successfully for color quantization [9]. 
The data partitioning tries to divide data space into small cells 
or clusters where intercluster distances are large as possible 
and intracluster distances are small as possible. 

For instance, Suppose ten data points in 2D data space are 
given. The goal is to partition the ten data points into two 
disjoint cells where sum of the total clustering errors of the 
two cells is minimal. Suppose a cutting plane perpendicular to 
X-axis will be used to partition the data. Let Ci and C 2 be the 
first cell and the second cell respectively and £ : and C : be the 

cell centroids of the first cell and the second cell, respectively. 
The total clustering error of the first cell is thus computed by: 

Z *m> (4) 

-L EC t 

and the total clustering error of the second cell is thus 
computed by: 

Where q is the i th data in a cell. As a result, the sum of total 
clustering errors of both cells is minimal. 

The partition could be done using a cutting plane that passes 
through m. Thus 
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d(.Ci.£ t }< dCcj,c TC ) + oCci-fip-3 onrf 



dCc 1B c 2 } < dCcvj tff.) + dfc;* c m ) 



Thus 



(6) 






^.EC- 



HI is called as the partitioning data point where IC1I and IC2I 
are the numbers of data points in cluster CI and C2 
respectively. The total clustering error of the first cell can be 
minimized by reducing the total discrepancies between all data 
in first cell to m, which is computed by: 

I '(ceo (8) 

The same argument is also true for the second cell. The total 
clustering error of second cell can be minimized by reducing 
the total discrepancies between all data in second cell to m, 
which is computed by: 



d l c i* c m} 



(9) 



f.EC- 



where d(c i9 c m ) is the distance between m and each data in 
each cell. Therefore the problem to minimize the sum of total 
clustering errors of both cells can be transformed into the 
problem to minimize the sum of total clustering error of all 
data in the two cells to m. 

The relationship between the total clustering error and the 
clustering point is illustrated in Fig. 4, where the horizontal- 
axis represents the partitioning point that runs from 1 to n 
where n is the total number of data points and the vertical-axis 
represents the total clustering error. When m=0, the total 
clustering error of second cell equals to the total clustering 
error of all data points while the total clustering error of first 
cell is zero. On the other hand, when m=n, the total clustering 
error of the first cell equals to the total clustering error of all 
data points, while the total clustering error of the second cell is 
zero. 



— 




Fig. 4 Graphs depict the total clustering error, lines 1 and 2 represent 

the total clustering error of the first cell and second cell, respectively, 

Line 3 represents a summation of the total clustering errors of the 

first and the second cells 

A parabola curve shown in Fig. 4 represents a summation of 
the total clustering error of the first cell and the second cell, 
represented by the dash line 2. Note that the lowest point of 
the parabola curve is the optimal clustering point (m). At this 
point, the summation of total clustering error of the first cell 
and the second cell are minimum. 

Since time complexity of locating the optimal point m is 
0(n 2 ), the distances between adjacent data is used along the X- 
axis to find the approximated point of n but with time of O(n). 

Let D- = d^pC-^ 2 be the squared Euclidean distance of 

adjacent data points along the X-axis. 

If i is in the first cell then c(cv. c < Z-V -?-. On the one 

hand, if i is in the second cell then d[c^ t £ : ) < Y.T =m Dj 

The task of approximating the optimal point (m) in 2D is 
thus replaced by finding m in one-dimensional line. 
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Fig. 5 Illustration of the ten data points on a one-dimensional 
line and the relevant Dj 

The point (m) is therefore a centroid on the one dimensional 
line (as shown in Fig. 5), which yields 
K ^ (10) 

: = L :=n 

Let risunig = £ : r=1 D- and a centroidDist can be computed 



cmtroidDis: = 



ilL^sami 



(id 



Partitioning Point 



It is probable to choose either the X-axis or Y-axis as the 
principal axis for data partitioning. However, data axis with 
the highest variance will be chosen as the principal axis for 
data partitioning. The reason is to make the inter distance 
between the centers of the two cells as large as possible while 
the sum of total clustering errors of the two cells are reduced 
from that of the original cell. To partition the given data into k 
cells, it is started with a cell containing all given data and 
partition the cell into two cells. Later on the next cell is 
selected to be partitioned that yields the largest reduction of 
total clustering errors (or Delta clustering error). This can be 
described as Total clustering error of the original cell - the 
sum of Total clustering errors of the two sub cells of the 
original cell. This is done so that every time a partition on a 
cell is performed, the partition will help reduce the sum of 
total clustering errors for all cells, as much as possible. 
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The partitioning algorithm can be used now to partition a 
given set of data into k cells. The centers of the cells can then 
be used as good initial cluster centers for the K-means 
algorithm. Following are the steps of the initial centroid 
predicting algorithm. 

1 . Let cell c contain the entire data set. 

2. Sort all data in the cell c in ascending order on each 
attribute value and links data by a linked list for each attribute. 

3. Compute variance of each attribute of cell c. Choose an 
attribute axis with the highest variance as the principal axis for 
partitioning. 

4. Compute squared Euclidean distances between adjacent 
data along the data axis with the highest variance 
D- = d(c-.c-_;) : and compute the dram- = ^\ = ^D l 

5. Compute centroid distance of cell c: 

cvntrvitiDis: = — — - 

n. 

Where dsumi is the summation of distances between the 
adjacent data. 

6. Divide cell c into two smaller cells. The partition 
boundary is the plane perpendicular to the principal axis and 
passes through a point m whose dsumi approximately equals 
to centroidDist. The sorted linked lists of cell c are scanned 
and divided into two for the two smaller cells accordingly 

7. Calculate Delta clustering error for c as the total 
clustering error before partition minus total clustering error of 
its two sub cells and insert the cell into an empty Max heap 
with Delta clustering error as a key. 

8. Delete a max cell from Max heap and assign it as a 
current cell. 

9. For each of the two sub cells of c, which is not empty, 
perform step 3 - 7 on the sub cell. 

10. Repeat steps 8-9. Until the number of cells (Size of 
heap) reaches K. 

1 1 . Use centroids of cells in max heap as the initial cluster 
centers for K-means clustering 



5. if (a <o ), the current block is encoded as "0". 
else 

the current block is encoded as "1" followed by 
interpolated residual sign bitplane which is computed as 
described in section 3.2. 

6. Repeat the step 4 until no more blocks exist in the 
image. 

The decoding of the compressed images is done by reversing 
the above said steps and residual block to be added is 
reconstructed for each less similarity block as described in 
section 3.2. 

IV . Experimental Results 

To evaluate the proposed technique experiments are carried 
out on standard gray scale images using a Pentium-IV 
computer running at 1.60 GHz under Windows XP. Three 
images of 512 x 512 pixels in size are used. Codebook is 
generated using Modified K-Means algorithm for all the 
methods. Codebook is also generated with LBG [5] for 
comparison. For this scheme, a codebook of size 64 is used. 
Performances of the above algorithms are evaluated in terms 
of bit rate (bits per pixel) and peak signal-to-noise ratio 
(PSNR) given by: 



PSXR = 10 log ir ■ 



USE 



■c& 



where MSE (mean squared error) is defined as: 



™-2]<"-ji' 



where xi and y A denote the original and the encoded pixel 
values and n is the total number of pixels in an image. Bit rate 
including overhead bits (i.e bits need to store codebook) for 
different threshold values ranging from 50 to 2000 for Lena, 
Camera man and Pepper. 



3.4. The Proposed Algorithm 

The proposed scheme combines compression of VQ 
indices and residual codebook. The steps used in this 
compressor are as follows 

1. An image which is to be compressed is decomposed 
into a set of non-overlapped image blocks of size 4x4 pixels. 

2. A codebook is generated for the image blocks using 
Modified K-Means algorithm. 

3. Construct a Residual Codebook (as described in section 
3.4) for those image blocks (less similarity blocks) whose a is 
greater than o . 

4. Pick the next image block (current block) and find its 
closest match codeword in the codebook. Calculate mean 
square error a for the image block using equation (2) and 
index of the codeword is encoded using VQ indices 
compression scheme presented in section 3.1. 



The performance of proposed scheme is evaluated with 
the existing techniques for different gray- scale images of size 
512x512 and is given in the table I. J is set to 4 for the 
proposed scheme. From table I, it can be observed note that 
proposed method with the modified k-means instead of LBG 
has an improvement in coding the VQ indices. 

Table i 

Performance of proposed method vq with codebook 

size 64 using lbg, k-means and modified k-means ln 

coding standard gray scale images of size 512x512 

EACH 



Images 


VQ 

bits/index 


LBG 

(bits/index) 


K-Means 

(bits/index) 


Modified 
K-Means 

(bits/index) 


Lena 


6 


3.92 


3.88 


3.47 


Cameraman 


6 


4.08 


3.99 


3.32 


Peppers 


6 


3.72 


3.66 


3.20 
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Table II shows the comparison of the PSNR values for the 
Lena, Camera man and Pepper images of 512x512 bits when 
compressed with combine VQ method and Codebook using 
LBG, K-Means and Modified k-Means algorithm. 



Table II 
Comparison of PSNR values for three standard 
images 



Images 


VQ 

bits/index 


LBG 
(dB) 


K- 

Means 
(dB) 


Modified 

K-Means 

(dB) 


Lena 


6 


31.60 


32.51 


34.65 


Cameraman 


6 


30.24 


31.25 


33.84 


Peppers 


6 


31.44 


32.26 


34.20 



37 
35 



LBG(dB) 
K-Means (dB) 
Modified K-Means (dB) 




Lean 



Camem Mnn 
Images 



Pqjpers 



Fig. 6 Comparison of PSNR values for three standard images 
From Fig. 6 it is observed that this scheme gives PSNR 
values of 34.65db, 33.84db and 34.20 db for Lena, Camera 
man and Peppers respectively. From this it can be observed 
that the proposed approach produces better result than all the 
existing methods. 

V. Conclusion 

The rapid increase in the range and use of electronic 
imaging justifies attention for systematic design of an image 
compression system and for providing the image quality 
needed in different applications. There are a lot of techniques 
available for image compression. In this paper, a new gray 
scale image compression scheme is proposed which gives 
better image quality and low bit rate. This scheme is based on 
VQ method and employs residual codebook to improve image 
quality and compression of VQ indices to lower the bit rate. 
Experimental results on standard images show that the 
proposed scheme gives better PSNR values and low bit rate 
than previous methods with codebook generation using LBG 
and kmeans. Since this scheme uses smaller codebook, it gives 
faster compression than the other two schemes. 
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Abstract — Data Warehouses (DW) typically grows 
asynchronously, fed by a variety of sources which all serve a 
different purpose resulting in, for example, different reference 
data. ETL is a key process to bring heterogeneous and 
asynchronous source extracts to a homogeneous environment. 
The range of data values or data quality in an operational system 
may exceed the expectations of designers at the time validation 
and transformation rules are specified. Data profiling of a source 
during data analysis is recommended to identify the data 
conditions that will need to be managed by transformation rules 
and its specifications. This will lead to implementation of the ETL 
process. Extraction-Transformation-Loading (ETL) tools are set 
of processes by which data is extracted from numerous 
databases, applications and systems transformed as appropriate 
and loaded into target systems - including, but not limited to, 
data warehouses, data marts, analytical applications, etc. Usually 
ETL activity must be completed in certain time frame. So there is 
a need to optimize the ETL process. A data warehouse (DW) 
contains multiple views accessed by queries. One of the most 
important decisions in designing a data warehouse is selecting 
views to materialize for the purpose of efficiently supporting 
decision making. Therefore heuristics have been used to search 
for an optimal solution. Evolutionary algorithms for materialized 
view selection based on multiple global processing plans for 
queries are also implemented. The ETL systems work on the 
theory of random numbers, this research paper relates that the 
optimal solution for ETL systems can be reached in fewer stages 
using genetic algorithm. This early reaching of the optimal 
solution results in saving of the bandwidth and CPU time which it 
can efficiently use to do some other task. Therefore, the proposed 
scheme is secure and efficient against notorious conspiracy goals, 
information processing. 

Keywords- Extract, Transform, Load, Data Warehouse(DW), 
Genetic Algorithm (GA), Architecture, Information Management 
System, Virtual Storage Acess Method and Indexed Sequential 
Access Method 



I. 



Introduction 



Companies know they have valuable data lying around 
throughout their networks that needs to be moved from one 
place to another — such as from one business application to 
another or to a data warehouse for analysis. The only problem 
is that the data lies in all sorts of heterogeneous systems and 
therefore in all sorts of formats. To integrate data to one 
warehouse for analysis a tool is required which can integrate 
data from various systems. To solve the problem, companies 
use extract, transform and load (ETL) software. Usually ETL 



activity must be completed in certain time frame. So there is a 
need to optimize the ETL process. Typical ETL activity 
consists of three major tasks: extraction, transformation and 
loading. 

This research paper is the study of extraction and 
transformation stages. Data extraction can be seen as reader 
writer problem, which has been reformulated using multiple 
buffers instead of using single finite buffer. Transformation is 
set of activities which convert the data from one form to other. 
This thesis studies the use of Genetic Algorithm to optimize 
the ETL workflow. 

A Basic concepts of ETL 

a) Extract 

The first part of an ETL process involves extracting the data 
from the source systems. Most data warehousing projects 
consolidate data from different source systems. Each separate 
system may also use a different data organization/format. 
Common data source formats are relational databases and flat 
files, but may include non-relational database structures such 
as Information Management System (IMS) or other data 
structures such as Virtual Storage Access Method (VSAM) or 
Indexed Sequential Access Method (ISAM), or even fetching 
from outside sources such as web speeding or screen-scraping. 
Extraction converts the data into a format for transformation 
processing. An intrinsic part of the extraction involves the 
parsing of extracted data, resulting in a check if the data meets 
an expected pattern or structure. If not, the data may be 
rejected entirely. 

b) Transform 

The transform stage applies to a series of rules or functions 
to the extracted data from the source to derive the data for 
loading into the end target. Some data sources will require 
very little or even no manipulation of data. In other cases, one 
or more of the following transformations types to meet the 
business and technical needs of the end target may be 
required: 

• Selecting only certain columns to load (or selecting 
null columns not to load). 

• Translating coded values (For example, if the source 
system stores 1 for male and 2 for female, but the 
warehouse stores M for male and F for female), this 
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calls for automated data cleansing; no manual 
cleansing occurs during ETL. 

• Encoding free-form values (For example, mapping 
"Male" to "1" and "Mr" to M). 

• Deriving a new calculated value (For example, 
saleamount = qty * unit_price). 

• Filtering. 

• Sorting. 

• Joining data from multiple sources (For example, 
lookup, merge). 

• Aggregation (for example, rollup - summarizing 
multiple rows of data - total sales for each store, and 
for each region, etc). 

• Generating surrogate-key values. 

• Transposing or pivoting (turning multiple columns 
into multiple rows or vice versa). 

• Splitting a column into multiple columns (For 
example, putting a comma- separated list specified as 
a string in one column as individual values in 
different columns). 

• Applying any form of simple or complex data 
validation. If validation fails, it may result in a full, 
partial or no rejection of the data, and thus none, 
some or all the data is handed over to the next step, 
depending on the rule design and exception handling. 
Many of the above transformations may result in 
exceptions, for example, when a code translation 
parses an unknown code in the extracted data. 



c) Load 

The load phase loads the data into the end target, usually 
the Data Warehouse (DW). Depending on the requirements of 
the organization, this process varies widely. Some data 
warehouses may overwrite existing information with 
cumulative, updated data every week, while other DW (or 
even other parts of the same DW) may add new data in a 
histories form, for example, hourly. The timing and scope to 
replace or append are strategic design choices dependent on 
the time available and the business needs. More complex 
systems can maintain a history and audit trail of all changes to 
the data loaded in the DW. As the load phase interacts with a 
database, the constraints defined in the database schema — as 
well as in triggers activated upon data load — apply (for 
example, uniqueness, referential integrity, mandatory fields), 
which also contribute to the overall data quality performance 
of the ETL process. 

II. ETL Requirements 

A practical and secure optimization of workflow in ETL 
which must satisfy the following basic requirements which can 
be explored as follows [1], [2], [3]: 
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ETL stands for extract, transform and load, the processes 
that enable companies to move data from multiple sources 
reformat and cleanse it, and load it into another database, a 
data mart or a data warehouse for analysis, or on another 
operational system to support a business process. Companies 
know they have valuable data lying around throughout their 
networks that needs to be moved from one place to another — 
such as from one business application to another or to a data 
warehouse for analysis. The only problem is that the data lies 
in all sorts of heterogeneous systems, and therefore in all sorts 
of formats. For instance, a CRM (Customer Relationship 
Management) system may define a customer in one way, 
while a back-end accounting system may define the same 
customer differently. To solve the problem, companies use 
extract, transform and load (ETL) software, which includes 
reading data from its source, cleaning it up and formatting it 
uniformly, and then writing it to the target repository to be 
exploited. The data used in ETL processes can come from any 
source: a mainframe application, an ERP application, a CRM 
tool, a flat file, an Excel spreadsheet — even a message queue. 
Extraction can be done via Java Database Connectivity, 
Microsoft Corporation's Open Database Connectivity 
technology, proprietary code or by creating flat files. After 
extraction, the data is transformed, or modified, depending on 
the specific business logic involved so that it can be sent to the 
target repository. There are a variety of ways to perform the 
transformation, and the work involved varies. The data may 
require reformatting only, but most ETL operations also 
involve cleansing the data to remove duplicates and enforce 
consistency. Part of what the software does is, examines 
individual data fields and applies rules to consistently convert 
the contents to the form required by the target repository or 
application. In addition, the ETL process could involve 
standardizing name and address fields, verifying telephone 
numbers or expanding records with additional fields 
containing demographic information or data from other 
systems. The transformation occurs when the data from each 
source is mapped, cleansed and reconciled so it all can be tied 
together, with receivables tied to invoices and so on. After 
reconciliation, the data is transported and loaded into the data 
warehouse for analysis of things such as cycle times and total 
outstanding receivables. In the past, companies that were 
doing data warehousing projects often used homegrown code 
to support ETL processes. However, even those that had done 
successful implementations found that the source data file 
formats and the validation rules applying to the data evolved, 
requiring the ETL code to be modified and maintained. And 
companies encountered problems as they added systems and 
the amount of data in them grew. Lack of scalability has been 
a serious issue with homegrown ETL software. Providers of 
packaged ETL systems include Microsoft, which offers data 
transformation services bundled with its SQL Server database. 
Oracle has embedded some ETL capabilities in its database, 
and IBM offers a DB2 Information Integrator component for 
its warehouse offerings. More than half of all development 
work for data warehousing projects is typically dedicated to 
the design and implementation of ETL processes. Poorly 
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designed ETL processes are costly to maintain, change and 
update, so it is critical to make the right choices in terms of the 
right technology and tools that will be used for developing 
logic involved so that it can be sent to the target repository. 
The basic steps used for the development of the ETL Life 
cycle are as follows: 

1 . Cycle initiation. 

2. Build reference data. 

3 . Extract (from sources) . 

4. Validate. 

5. Transform (clean, apply business rules, check for data 
integrity, create aggregates). 

6. Stage (load into staging tables, if used). 

7. Audit reports (for example, on compliance with 
business rules. Also, in case of failure, helps to 
diagnose/repair). 

8. Publish (to target tables). 

9. Archive. 

10. Clean up. 

1) Architecture of Connector 

Basic architecture of this connector will be: 

The following operations will be performed by the 
connector; 

• Reading data from source to the Quick ETL buffer. 

• Writing data to target from the Quick ETL buffer. 

• Managing the Meta data. 



Native API 



Reader 



Database 



Client GUI 



Meta Data 



— H Repository 



Native API 



Writei 



Buffer 



Quick ETL 



Figure 1 - Architecture of Connector 
Various components of the connector are: 

• Reader. 

• Writer. 

• Client GUI. 



III. 
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GENETIC ALGORITHMS 



Melanie, M., in his book "An Introduction to Genetic 
Algorithms" stated that generally speaking, genetic algorithms 
are simulations of evolution, of what kind ever. In most cases, 
however, genetic algorithms are nothing else than probabilistic 
optimization methods which are based on the principles of 
evolution. He further suggested that if there is a solvable 
problem, a definition of an appropriate programming 
language, and a sufficiently large set of representative test 
examples (correct input-output pairs), a genetic algorithm is 
able to find a program which (approximately) solves the 
problem [12]. 

Goldberg, D in his book "Genetic Algorithms in Search, 
Optimization and Machine Learning" stated that crossover 
encourages information exchange among different individuals. 
It helps the propagation of useful genes in the population and 
assembling better individuals. In a lower level evolutionary 
algorithm, the crossover is implemented as a kind of cut-and- 
swap operator[6]. 

In 2003 Ulrich Bodenhofer suggested that in solving the 
problems related to genetic algorithms the following steps can 
betaken [4]: 

Algorithm 

t:=0; 

Compute initial population BO; 

WHILE stopping condition not fulfilled DO 

BEGIN 

select individuals for reproduction; 

create offsprings by crossing individuals; 

eventually mutate some individuals; 

compute new generation 

END 

As obvious from the above algorithm, the transition from 
one generation to the next consists of following basic 
components: 

Selection: Mechanism for selecting individuals (strings) for 
reproduction according to their fitness (objective function 
value). 

Crossover: Method of merging the genetic information of 
two individuals; if the coding is chosen properly, two good 
parents produces good children. 

Mutation: In real evolution, the genetic material can by 
changed randomly by erroneous reproduction or other 
deformations of genes, for example, by gamma radiation. In 
genetic algorithms, mutation can be realized as a random 
deformation of the strings with a certain probability. The 
positive effect is preservation of genetic diversity and, as an 
effect, that local maxima can be avoided. 
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Following inferences were drawn about the Genetic 
Algorithms from the study and research carried by 
Bodenhofer: 

1 . Genetic Algorithms manipulate coded versions of the 
problem parameters instead of the parameters themselves, 
i.e. the search space is S instead of X itself. 

2. Genetic Algorithms use probabilistic transition 
operators while conventional methods for continuous 
optimization apply deterministic transition operators. 
More specifically, the way a new generation is computed 
from the actual one has some random components 

3. Normal genetic algorithms do not use any auxiliary 
information about the objective function value such as 
derivatives. Therefore, they can be applied to any kind of 
continuous or discrete optimization problem. The only 
thing to be done is to specify a meaningful decoding 
function. 

4. While almost all conventional methods search from a 
single point, Genetic Algorithms always operate on a 
whole population of points (strings). This contributes 
much to the robustness of genetic algorithms. It improves 
the chance of reaching the global optimum and vice versa, 
reduces the risk of becoming trapped in a local stationary 
point. 

Banzhaf [1999] had drawn the inference that Evolutionary 
algorithms have been shown to solve many real world 
problems. They use population based stochastic search 
strategies and are unlikely to be trapped in a poor local 
optimum. They make few assumptions about a problem 
domain yet are capable of incorporating domain knowledge in 
the design of chromosome representation and variation 
operators. They are particularly suited for large and complex 
problems where little prior knowledge is available [2] . 

The materialized view selection based on multiple query 
processing plans is a hard combinatorial optimization problem. 
Good selection of materialized views can only be found by 
taking a holistic approach and considering the optimization of 
both global processing plans and materialized view selection. 
A two- level structure for materialized view selection should be 
followed so as to get the proper result. It has facilitated greatly 
the development of several hybrid algorithms. In this literature 
survey the inference is drawn that there are several hybrid 
heuristic and evolutionary algorithms. 

Pure evolutionary algorithms were found to be impractical 
due to their excessive computation time. Pure heuristic 
algorithms were unsatisfactory in terms of the quality of the 
solutions they found. Hybrid algorithms that combine the 
advantages of heuristic and evolutionary algorithms seem to 
perform the best. It show that applying an evolutionary 
algorithm to either global processing plan optimization or 
materialized view selection for a given global processing plan 
can reduce the total query and maintenance cost significantly. 
This is further revealed from the literature survey carried out 
that simply combining or merging optimal local processing 
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plans will not produce an optimal global processing plan in 
most cases. Finding an optimal global processing plan with 
optimal materialized views requires a two level hierarchy is 
needed. While the hybrid algorithms perform better than the 
heuristic algorithm in terms of cost savings, they often require 
longer computation time. While the heuristic algorithm 
typically took seconds to run, a hybrid algorithm typically 
took minutes, or even hours to run. Finding the suitable trade- 
off between the computation time and the cost saving is the 
basic concept which we have to maximize. 



Once a data warehousing design is completed and 
implemented, it will be used frequently and may last for a long 
time. Hence it is very important to optimize the design as 
much as possible, even if this means a relatively long design 
time. For optimizing the ETL process in the data ware houses 
we have to set up the theoretical framework for the problem, 
by modeling the problem as a state space search problem, with 
each state representing a particular design of the work flow as 
a graph. Since the problem is modeled as a state space search 
problem, it is the mandatory requirement to define transitions 
from one state to another. 

IV. EFFICIENCY IMPROVEMENT IN READER 
WRITER PROBLEM 

Job of the reader is to get data from the source system and 
give it to transformation unit for processing. In the 
transformation stage this data is processed and finally written 
to the target system. 
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Figure 2 - ETL workflow execution timing. 

Typical steps to read data from Source and transfer it to 
transformation stage consist of following steps: 

• Fetch data from oracle server to local buffer. 

• Transform the data type. 

• Fill the target buffer. 

• Flush the buffer. 

• Repeat till data is available on oracle server. 
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Above mechanism to read and process data has following 
issues: 

• When data is fetched from oracle server to local 
buffer, CPU is almost idle. 

• During data transformation network bandwidth is not 
utilized. 

• During flushing of data CPU is idle. 

The network bandwidth and CPU time can be efficiently 
used using two buffers instead of one buffer. Second buffer 
can be filled while first buffer is being processed and after the 
processing of first buffer, second buffer will be ready to 
process so waiting time will reduce. The steps will be: 

• Fetch data into first buffer. 

• While transformation and flushing fetch the data from 
oracle server to second buffer. 

• Adjust the buffer size so that time to fetch is same as 
time for transformation and flushing. 

Using two buffers following scenario exist: 

• Fetch time is less than sum of transformation time 
and loading time. 

• Fetch time is greater than sum of transformation time 
and loading time. 

• Fetch time is same as sum of transformation time and 
loading time. 

For the optimal utilization of network and CPU time, Fetch 
time should be same as sum of transformation time and 
loading time. 

Case 1 : T F < T T + Tu 
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• Update the time TF1 used to fill buffer. 
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Figure 3 - Scenario for Two Buffer Implementation of 
Reader 

To make the fetch time same as transformation time + 
loading time, the buffer size is changed dynamically. In the 
proposed algorithm two threads are used, one for fetching and 
other for transformation and loading. Both will work as: 

Thread to fetch data: 

• Check the status of buffer 1 . 

• If empty fetch data in buffer 1 and change 
the status to filled. 



• If no more data available stop. 

• Check the status of buffer2. 

• If empty fetch data in buffer2 and change 
the status to filled. 

• Update the time TF2 used to fill buffer. 

• If no more data available stop. 
Thread for transformation and uploading: 

• Check the status of buffer 1. 

• If filled then transform data and flush it. 

• Change the status to empty. 

• Note the time for transformation and 
flushing. TP1=TT1+TU1. 

• If TF1 < TP1 increase the buffer size. 

• Check the status of buffer2. 

• If filled then transform data and flush it. 

• Change the status to empty. 

• Note the time for transformation and 
flushing. TP2 = TT2 + TU2. 

• If TF2 > TP2 decrease the buffer size. 
Adjusting buffer size: 

• Buffer Size: S 

• IF abs (TF - TP) > 2 // 
Minimum Time Difference 

IF TF > TP 
dS= S * TF / TP * Rnd // Random Value 
between & 1 
New Buffer Size = S - OS 

IF TF < TP 
dS= S * TP / TF * Rnd 
New Buffer Size = S + OS 

V. WORKFLOW OPTIMIZATION 

To implement the ETL workflow as state- space search 
problem the workflow is represented by a directed acyclic 
graph, where each node represents a transformation and edge 
represents the workflow. Consider the example: Two different 
databases contain the purchase information from different 
vendors. The database contains date, amount and vendor id. 
Now both the data need to be merged to single database so the 
possible workflow can be as in Figure 4. Where S is source 
node, SK is surrogate key assignment, U is union, SL is 
selection transformation and T is target. 
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Figure 4- Initial Workflow 

Solution Encoding 

The solution for the current problem is encoded as an array 
of nodes. Each node represents a transformation. For example 
(1,2,6,7,3,4,5). 

Initial Population 

Initial population is generated by random transformation on 
the given workflow. 

Fitness Function 

The objective function of the individual is based on the total 
cost to execute the workflow. The cost of any transformation 
is defined by the cost model chosen. The problem is to 
minimize the cost, so an fitness function is derived which 
needs to be maximized. 

Operators Selection 

The tournament selection is used to generate new 
population. 

Crossover 

Single-point cross over is implemented to test the results. In 
the single-point crossover a random node is chosen and the 
parents are cut at crossover points and then join to produce the 
new individuals. 

For example, consider following two individuals: 



> 

L 




SL^^^ 




SK"^ 


/ 






>^ 




4 .^-^ 


U 
3 
















r 


r 
s 








> 

5 




SK"^_ 




SL\. 






>^ 




4 ~***^ 







Figure 5 - Selected first individual 



Figure 6 - Selected second individual 
After crossover new individual will be 
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Figure 7- First individual after crossover 
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Figure 8 - Second individual after crossover 
Mutation 
Mutation with probability 1/3 is used. Mutation is applied 

• Generate a random number. 

• If random number is greater than mutation probability 
Exit. 

• Find the list of transformations applicable to the 
node. 

• Select a random transformation. 

• Apply the transformation to the node. 

In the experiment following transformation are supported: 

• Swap. 

• Factorize. 
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Distribute. 
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Crossover Simple 



The algorithm used to generate the optimal workflow is as 
follows: 

Initialize population from the given workflow by random 
mutation; 

Do 

Generate the Cost of each workflow 

Make a log of lowest cost workflows 

Determine which individuals should survive with 
fitness function; 

Reproduce the survivors; 

Select parents randomly from the survivors for 
crossover; 

Select the crossover sites of the parents; 

Produce the next new generation of workflows; 

Mutate the new generation of workflows according to 
the mutation probability; 

If iteration limit exceeded 

output the optimal solutions 

exit 

endif 

loop 

A) Generating Initial Population 

Initial population is generated from the given ETL 
workflow by randomly applying transitions on the workflow. 
The pseudo code for the initial population generation is: 

While not popsize 

Copy the given workflow to create a new individual. 

Select a random node. 

Select a random transition. 

Apply the transition to the selected node. 

But during this processing network and CPU are not fully 
utilized. System can be made more efficient using more than 
one buffer as during processing time data can be fetched into 
another buffer. Workflow optimization is formulated as state- 
space search problem and state-space search is implemented 
using genetic algorithm. 

B) Workflow Optimization 

For the workflow following experiments have been 
conducted: 



Selection Tournament Selection 
Crossover Probability 0.1 
Mutation Probability 0.9 

Below is the average of 100 Runs for 10 generations. 
Table 1 : Generation No Vs Average Fitness 



Generation No. 


Average Fitness 





396.9521 


1 


402.2411 


2 


408.1349 


3 


414.5278 


4 


420.2802 


5 


425.2696 


6 


430.4964 


7 


434.1143 


8 


440.2575 


9 


444.36096 
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Experiment 1 : 



Initial Population 200 



Figure 9 - Average Fitness Vs Generation No 
Experiment 2: 

Initial Population 200 
Crossover Simple 

Selection Tournament Selection 
Crossover Probability 0.9 
Mutation Probability 0.1 
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Below is the average of 100 Runs for 10 generations. 
Table 2: Generation No Vs Average Fitness 



Generation No. 


Average Fitness 





397.9563 


1 


403.0553 


2 


409.2708 


3 


416.0089 


4 


420.0008 
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426.4026 
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431.6523 
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437.5481 


8 


443.3943 


9 


447.2889 



Figure 10 - Average Fitness Vs Generation No 
Comparative Study 
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Figure 11- Average Fitness Vs Generation No for Different 
crossover probabilities 

Experiment 3: 

Random Initial Solution 200 
Below is the average of 100 Runs for 10 generations. 

Table 3: Generation No Vs Average Fitness 



Generation No. 


Average Fitness 





397.9763 


1 


403.0553 


2 


397.9763 


3 


416.0089 


4 


397.9763 


5 


397.9763 


6 


431.6523 


7 


397.9763 


8 


416.0089 


9 


403.0553 
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Figure 12 - Average Fitness Vs Generation No for Random 
Workflow generation 

Comparison ofGA Vs Random 
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Figure 13: Comparison of Random workflow Vs GA 

Results: 

This is clear from the above graph that as the generation 
goes on increases the average value of the fitness goes on 
increasing. 



Limitations of The System 

This approach is used to generate the optimal workflow as a 
search space solution. This technique does not consider the 
analytical analysis of the problem. Fitness is proportional to 
the total cost but does not include analysis of the relations 
between operators. From the results it is clear that genetic 
algorithm outperforms random workflow optimization. For the 
workflow optimization this study has used only swap, 
factorize and distribute transitions. It is further observed that 
as the generation in the genetic algorithm goes on increasing 
then accordingly the optimization of the ETL system 
increases. However the above conclusions only relate to the 
comparison of the random numbers and genetic algorithms. 
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There is a need to further investigate all the possible 
transitions, and their applicability conditions. It has not be 
derived that up to what stage the optimization will keep on 
increasing monotonically using the genetic algorithms or it 
will decrease after some time depending upon the complexity 
of the data warehouse. This can be the topic of further research 
that upto which stage the optimitality of the ETL system 
increases with the use of genetic algorithms. 

VI. CONCLUSION 

Good selection of materialized views can only be found by 
taking a holistic approach and considering the optimization of 
both global processing plans and materialized view selection. 
Pure evolutionary algorithms were found to be impractical due 
to their excessive computation time. Pure heuristic algorithms 
were unsatisfactory in terms of the quality of the solutions 
they found. Hybrid algorithms that combine the advantages of 
heuristic and evolutionary algorithms seem to perform the 
best. 

For optimizing the ETL process in the data ware houses we 
have to set up the theoretical framework for the problem, by 
modeling the problem as a state space search problem, with 
each state representing a particular design of the work flow as 
a graph. Since the problem is modeled as a state space search 
problem, it is the mandatory requirement to define transitions 
from one state to another. For the workflow optimization this 
study has used only swap, factorize and distribute transitions. 
It is concluded that the results generated by optimizing the 
data of data warehouse using random numbers are less optimal 
as compared to when one uses the genetic algorithms. It is 
further observed that as the generation in the genetic algorithm 
goes on increasing then accordingly the optimization of the 
ETL system increases. However the above conclusions only 
relate to the comparison of the random numbers and genetic 
algorithms. 

The efficiency of the reader is improved by the use of two 
buffers. The buffer size depends on the factors like network 
bandwidth, memory size, CPU speed etc. There is a need to 
further investigate the relation between these parameters and 
optimal number of buffers which can improve the 
performance. For the optimization of workflow the new 
workflows have been generated with the implementation of 
genetic algorithms. The cost of newly generated workflows 
produced by genetic algorithms is less than those produced by 
random way because the genetic algorithm explores the search 
space very fast with respect to the desired objective function. 

VII. FUTURE SCOPE 

There is a need to further investigate all the possible 
transitions, and their applicability conditions. It has not be 
derived that up to what stage the optimization will keep on 
increasing monotonically using the genetic algorithms or it 
will decrease after some time depending upon the complexity 
of the Data Warehouses (DW). This can be the topic of further 
research that up to which stage the optimitality of the ETL 
system increases with the use of genetic algorithms. 
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Abstract — The speed of the daily growth of computational 
biology databases opens the door for researchers in this field of 
study. Although much work have been done in this field, the 
results and performance are still imperfect due to insufficient 
review of the current methods. Here in this paper we discuss the 
common and most popular methods in the field of 3D protein 
structure comparison and retrieval. Also, we discuss the 
representation methods that have been used to support similarity 
process in order to get better results. The most important 
challenge related to the study of protein structure is to identify its 
function and chemical properties. At this point, the main factor 
in determining the chemical properties and the function of 
protein is the three dimensional structure of the protein. In other 
words, we cannot identify the function of a protein unless we 
represent it in its three dimensional structure. Hence, many 
methods were proposed for protein 3D structure representation, 
comparison, and retrieval. This paper summarizes the challenges, 
advantages and disadvantages of the current methods. 

KeywordsSD protein structure; protein structure retrieval; 
protein structure comparison; PDB; 

I. Introduction 

Bioinformatics, considered a bridge connecting biology and 
computer science, is increasingly attracting the interest of 
researchers day by day. The size of protein, DNA and RNA 
databases is growing rapidly and as such necessitates the need 
for faster and efficient methods to manage and retrieve these 
data, expasy and rcsb [2, 3] are examples of protein databases 
websites which show the amount of the database growth every 
year. The goals of bioinformatics are to help biologists in 
collecting, managing, processing, storing, analyzing and 
retrieving genomic information that the biologists have and 
need [4]. One of the most interesting fields in bioinformatics is 
proteins where many researches focus on protein analyzing, 
predicting, comparison and similarity, retrieving, representation 
and more. The most important parts of proteins are its function 
and chemical properties which are determined at the 3D protein 
structure level [5]. So it is important to manage the data 
analyses, predict and retrieve the tertiary protein structure. 
Many researches have been carried out for this purpose. 
Furthermore, many databases (repositories) of protein 



structures are built to serve researchers in this field. One of the 
most common and essential protein structure repositories is the 
Protein Data Bank (PDB) [3, 6]. 

Most of the 3D protein structures in the database are 
determined using X-Ray crystallography methods and NMR 
[7]. These two methods are accurate but they are too slow and 
also too expensive. The first crystal 3D structure of protein 
myoglobin was determined and solved in 1958 [8]. 

Protein 3D structure similarity and retrieval importance 
increases day by day in tandem with the information that 
protein structures can provide and tell. Many methods have 
been proposed for protein structure representation, similarity 
and retrieval, but unfortunately the accuracy of the retrieval 
methods are still unsatisfactory. Those methods vary in 
techniques used in representation and comparison. 

The benchmarks for the similarity and retrieval of the 
protein structures are the time of retrieving back a structure 
from the database and the accuracy of the retrieved structures. 
Most of the researches that have been done in solving this 
challenge consider DALI, VAST, computational extension 
(CE) and SCOP as a performance metrics. 

1) PDB 
Created and started in 1971 as an archive library for 
biological structures of macromolecules at Brookhaven 
National Laboratories [3, 6, 9]. The focus on PDB comes due 
to its importance and services in this domain where it is the 
most common database as well as it is considered as a primary 
database of protein 3D structures. The structures in PDB are 
obtained by two famous methods, X-Ray Crystallography and 
NMR [7], where they have been carefully validated. Figure 1 
shows an example of the PDB database format. 
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COMPND MOL_ID: 1; 

COMPND 2 MOLECULE: GLUTATHIONE SYNTHETASE; 

COMPND 3 CHAIN: A; 



SOURCE MOL_ID: 1; 
SOURCE 2 ORGANISM_SCIENTIFIC: AVIAN SARCOMA VIRUS; 



REMARK 3 REFINEMENT. 
REMARK 3 PROGRAM : X-PLOR 3.851 



Figure 1: PDB File Format Example [1] 

Besides PDB database there are many databases that serve 
the 3D protein structures domain. The Structural Classification 
of Proteins, SCOP, is a protein structure database which 
describes the known evolutionary relationship of the protein 
structures as well as its structural relationship. It has been held 
at Cambridge University in the medical research council [10, 
11]. The classification of protein Class, Architecture, 
Topology, and Homologous superfamily, CATH, [12] is a 
database that classifies the structures in the PDB hierarchically, 
and held at University of London. Furthermore, the Families of 
Structurally Similar Proteins (FSSP) database, built at the 
European Bioinformatics Institute, was created based on the 
DALI method [13, 14]. Moreover, a database called PROSITE 
[2, 15] is for the family classification of proteins where protein 
structures are classified into families that share the same 
functions. 

2) Protein Structures 

Proteins are the basic component of human cells as well as 
being the largest. So, the importance of proteins is clear 
regarding the role that proteins play in determining the function 
of cells. Proteins have many structures where each structure 
helps in the understanding the functions and chemical 
properties of living cells. The functions and chemical 
properties of proteins cannot be identified or determined before 
forming its tertiary structure. [16] shows the four levels of 
proteins starting from the amino acid sequences ending with its 
quaternary structure . 

The trusted methods for identifying the tertiary structure of 
proteins are X-Ray Crystallography and NMR [7]. But the 
problems with those methods are cost and time where they are 
expensive and much time is massively consumed in order to 
form the tertiary structure. 

3) Retrieval process 

Searching for similar protein structures from the target 
database goes through many processes. First, the protein gets 
represented in a proper way that is suitable for comparison 
methods. This transformation of the protein has to be done for 
both the query protein structure and the database. This process 
is considered as a pre-process due to the size of the database 
and the time consumed by this stip. The rest of the sub- 
processes are all about how to get and measure the similarity 
and search for the query protein structure. 
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4) Problem Domain 

Protein structure comparison and retrieval is one of the 
most important challenges in bioinformatics. Researchers' 
outputs in this field are still unsatisfactory where performance 
is less than the expected for time and accuracy. An advantage 
of protein structure retrieval is that it helps in predicting the 
tertiary structure of proteins and thus plays an important role in 
understanding and identifying the functions of protein. 

The challenges in this domain are accuracy and time where 
faster and high accuracy methods are required without 
sacrificing the time. Many methods have been produced in this 
research area to find out the optimal solution for solving this 
challenge. 



II. MATERIALS AND METHODS 

A. Similarity Representaion Methods 

Similarity representation of protein structure importance 
comes about due to its role in understanding the behavior of 
proteins. It helps in protein structure matching and similarity 
among other protein structures. Furthermore, it is the first step 
of protein structure comparison and retrieval. It is the process 
where the protein structure is built and rearranged in order to 
give simple and efficient representation for protein comparison 
to manage and efficiently prepare the matching. This data 
forming helps in fastening the comparison and retrieval process 
of proteins and has a high effect on the accuracy. 

Many methods have been proposed for protein 3D structure 
similarity representation in order to enhance the comparisons of 
performance and efficiency. The following sections present 
these methods. 

1) Matrix representation methods 
This group uses matrices for presenting protein 3D 
structures. These methods are divided into two sub-groups, 
distance and similarity matrices. 

a) Distance matrix: Two proteins are aligned in a matrix 
alike in order to represent them by calculating the distance 
between them. The values contained in the cells of the matrix 
represent the distance between the amino acids of the two 
proteins. 

Holm L. and Sander C. [17] proposed an algorithm for 
protein structures comparison called DALI. The protein 
structures were represented as a distance matrix. The alignment 
between patterns and protein structures is done by executing a 
pairwise comparison on the distance matrices' patterns, where 
the similar patterns are kept in a list called pair list. Then, the 
patterns in the pair list are gathered to be aligned into a large 
set of pairs. The algorithm focuses on the subset of the patterns 
because of the size of the distance matrix, where it increases by 
increasing the length of the patterns or protein structures,. The 
distance matrix is reduced and the similar patterns are limited, 
in order to decrease the scope of the research process. 

Aung Z. and Tan K.L [18] proposed a protein 3D structure 
retrieval system called PROTDEX2. The algorithm depends on 
index construction to represent the protein structure which is 
divided into two sub-processes, feature vectors extraction from 
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the contact regions (inter- SSE relationship) and constructing 
the inverted file index. The feature vectors are represented 
using distance matrix representation and SSEs (Secondary 
Structure Elements) vector representation. Each cell of the 
distance matrix contains the distance between the two Ca 
atoms, where the distance is calculated using Euclidean 
distance. To calculate the SSE vectors' start and end points, 
equations adopted from [19] were used. Also STRIDE 
algorithm [20] has been used to identify the SSEs. 

To construct the invert file index a 7 -dimensional feature 
vectors and hash table were generated. Generating the 7- 
dimensional feature vectors is done first before these vectors 
are hashed into 7 -dimensional hash table. Then, based on the 
generated hash table the inverted file index is built. 

Masolo K. and Ramamohanarao K. [21] proposed a method 
for protein structure representation in order to accelerate the 
protein structure retrieval process. This method is based on 
constructing the protein feature vectors by using wavelet 
techniques. The idea behind using wavelet is its ability of 
compressing the application without sacrificing any of the 
details [22]. The earlier step of this method is building the 
distance matrix in order to build the feature vectors. The 
distance matrix is built by using the pairwise distance in the Ca 
atoms level. To construct the representation of global structure 
they implemented the 2D decomposition of wavelet. Then the 
estimated coefficients are extracted from the upper part and the 
diagonal of the distance matrix. 

b) Similarity matrix: [23] Similar to the distance matrix 
the two proteins are aligned in a matrix, but the values in the 
cells of the matrix will present similarity values between amino 
acids. 

Shindyalov I. N. and Bourne P.E [24] proposed an 
algorithm to enhance the protein structure's similarity and 
retrieval process. The first step in this algorithm is defining the 
alignment path. The alignment path can be defined as the 
longest continuous path in the similarity matrix by aligning two 
protein structures. Also, the algorithm took into consideration 
the alignment gaps, where it has conditions to control that 
which the two AFPs (Alignment Fragment Pairs) are aligned 
without gaps or one of the two proteins has gaps. 

Chen S. C. and Chen T [25] proposed a protein structure 
retrieval method based on geometric hashing algorithm [26]. 
The pre-process for this algorithm is proteins feature extraction 
in order to get a new representation for the protein. For 
sequence alignment they adopted a similarity matrix called 
Dayhoff PAM250 [27] to enhance the performance of the 
algorithm. 

2) Graph representation methods 
Graph representation is one of the ways for protein 3D 
structure representation which is used to enhance the 
comparison and retrieval process for the protein structures. 

Chen S. C. and Chen T [28] proposed a new algorithm for 
protein structure similarity and retrieval based on geometrical 
features. The algorithm represents the protein structure 
depending on the spatial relationship. It looks for the best 
alignment of the proteins first, and then it extracts the 
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geometric feature of the protein in order to define its geometric 

features. 



Daras P. et al [29] proposed a three-dimensional shape 
structure comparison method for protein structure classification 
and retrieval. Protein structure representation in this algorithm 
is done by building a sphere and then triangulating it by using 
techniques of 3D modeling. By representing the protein as 
spheres, the number of connections and vertices will be 
reduced. Also in this step a new center of protein mass is 
calculated to be at the origin. 

Sael L. et al. [30] introduced a novel algorithm for protein 
structure comparison and retrieval using 3D Zernike 
descriptors. Constructing the surface of the protein structures 
and detecting the surface area of the 3D structures are the 
initial steps of calculating the 3D Zernike descriptors. To build 
the protein surface, the algorithm first determines the surface 
area in the space of the structure. Furthermore, to calculate the 
Connolly surface (Triangle mesh) the algorithm depends on an 
existing program called MSROLL [31]. Then, the triangle 
mesh is arranged in the grid in a way that fits the protein in the 
grid. 

B. Conventional Methods 

The first step of protein structure retrieval from the Protein 
Data Bank (PDB) is protein structure comparison. If two 
proteins have the same structure, this implies that they might 
have the same function. Finding a protein similarity in the PDB 
helps the biologists to discover new functions for the proteins 
and also it helps in identifying unknown proteins functions. 
Many methods have been proposed in order to find the 
similarities between proteins. In this section we are going to 
present a preliminary study of the existing classified methods 
regarding their approaches. 

1) Shape-Base approach 
Sael L. et al. [30] introduced a novel algorithm for protein 
structure comparison and retrieval using 3D Zernike 
descriptors. 3D Zernike descriptors are used to help build the 
protein structure surface which provides a simpler 
representation of protein structures. Furthermore, this new 
representation helps to increase the speed of the comparison 
process. As a result of this research, searching for a protein in a 
database that consists of a few thousand protein structures takes 
less than a minute. The accuracy of the algorithm was 89.6% as 
compared with a well known algorithm in the domain called 
combinatorial extension (CE) [24] . 



3D Zernike descriptor 
but not on the main chain 
in the search results. This 
similar surface shape but 
chain with different 
recommended that this al| 
process for the protein 
methods. 



focuses on the surface of the protein 
which in some cases results in errors 
fault because of some structures has a 
different main chain or similar main 
surface shape. Therefore, it is 
*orithm is used as a primary filter pre- 
structure comparison and retrieval 



Chen S. C. and Chen T. [25] proposed a protein structure 
retrieval method based on geometric hashing algorithm [26]. 
The idea of using the geometric hashing method is to find alike 
binding sites by applying surface matching on the protein. In 
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addition, a-hull and 3D reference frames techniques were 
adopted to simplify the algorithm complex computations. 

As compared with [25], this algorithm obtained more 
accurate results. The experiments show that going through 
substructures for matching protein structures gives better 
results than matching between the whole protein as one block. 



Daras P. et al. [32] proposed a three-dimensional shaped 
structure comparison method for protein structure classification 
and retrieval. Basically, the method depends on the geometry 
of the protein structures in the first stage, and in the second 
stage it depends on the primary and secondary structure of 
proteins. Next, when the 3D structure of the protein is built 
properly, the Spherical trace transform from [33] is used to 
build the geometry-based descriptor vectors. To evaluate the 
accuracy of the classification produced by using this method, 
FSSP/DALI database is used. 

The accuracy of classification is calculated by the number 
of correctly predicted proteins over the real number of proteins 
in the database. The results of this method are compared with 
[34] to compare the accuracy of the classification and [25] to 
compare the retrieval performance by considering them close to 
the scope of the work. Regarding its accuracy, the proposed 
method outperformed the performance of [34] for a single 
domain chain. But in multiple domain proteins, the results were 
unconvincing. The performance of the proposed method shows 
some improvement. 

Zhou Y. et al. [35] proposed a method that depends on 
using one of the shape distribution methods. It depends on the 
distribution of the backbone's Ca representation. The method is 
proposed to measure the structural similarity between two 
proteins. It gives a strong shape distribution similarity among 
proteins with close structure and functions. The use of shape 
distribution is to calculate the distance between the atoms (Ca 
atoms). After studying the shape functions they found that the 
most suitable shape function is D2 where it is easy to 
understand, faster, and robust. 

2) Heuristic approach 

Holm L. and Sander C. [17] proposed an algorithm for 
protein structures comparison called DALI. DALI algorithm is 
considered as a benchmark in the protein structure retrieval 
domain since many algorithms compare their results with it, in 
order to evaluate their results. The algorithm is looking for the 
best pairwise alignment [36-38] of proteins structures. To 
obtain the similarity among protein structures, the DALI 
algorithm looks for the occurrence of the query pattern in the 
protein structures database, then it identifies the largest 
common sequence among them. It gives high accurate and 
sensitive results regarding its search for similar structures of 
the two proteins, furthermore it does not get affected by the 
geometrical noise. 

The DALI method shows high robustness and it is a general 
method which means it can be considered for solving other 
problems in the field. However, this method sacrifices the time 
in order to gain more accurate results with high sensitivity, 
where the searching process for one structure against the 
database takes almost a night. 
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Another well-known algorithm in this field is called CE 

(Combinatorial Extension) proposed by Shindyalov I. N. and 

Bourne P.E. [24] to enhance the protein structure similarity and 

retrieval process. CE algorithm is one of the best algorithms of 

protein retrieval regarding its accuracy of retrieving structures, 

but it saves time. Furthermore, the CE algorithm provides a 

good resolution of search in terms of finding protein structural 

similarity. Most of the methods in this field use CE algorithm 

to evaluate their results by comparing with CE results. The 

evaluation is in terms of accuracy and time of retrieving protein 

structures. 



Aung Z. and Tan K.L. [18] proposed a protein 3D structure 
retrieval system called PROTDEX2. They came out with a 
system by adopting methods and techniques used successfully 
in information retrieval (IR) systems. The main objective 
behind this adaptation was to increase the speed of the 
searching process via protein database. The new system shows 
noticeable improvement in both sides, time and accuracy 
compared with PROTDEX [39] which is a previous version of 
PROTDEX2. 

Speeding up the retrieving time is at the expense of accuracy. 
The results when compared with popular existing systems in 
this field which are DALI, CE, TOPSCAN [40] and the 
previous version of this work PROTDEX, PROTDEX2 
produce the best retrieving time but not optimal results. 

Chen S. C. and Chen T., [25] proposed an algorithm for 
protein structure similarity and retrieval based on geometrical 
features. Therefore, by identifying the geometrical features 
they avoid dealing with the chemical characteristics of the 
protein as well as the biological properties. To find the 
similarity among protein structures the algorithm focuses on 
getting the similarity of the appearance and spatial relationship 
of two sets of points. To speed up the process of comparison 
they identify the geometry features of the protein. Also, before 
extracting the features the algorithm searches for best 
alignment between the proteins. The new algorithm was 
compared with DALI and it was close in performance to DALI 
with more simplicity and efficiency. 

III. Conclusion 

In general, methods of 3D protein structure comparison and 
retrieval comes under one of two well known approaches 
which are heuristic and shape-base. Methods that follow the 
heuristic approach in general are fast. But on the other hand, 
they still have heuristic weaknesses as they sacrifice on 
accuracy. Shape-based methods focus on the robustness of the 
algorithm in order to reach higher accuracy and sensitivity with 
a reasonable running time. Anyhow, research in 3D protein 
structure comparison and retrieval field is still open while the 
current researches' results are still unconvincing. So, new 
innovative methods are needed to gain high performance in 
both accuracy and speed up. 
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Abstract — Ad-hoc networks are characterized by 
multihop wireless connectivity, frequently changing 
network topology and the need for efficient dynamic 
routing protocols plays an important role. Due to 
mobility in Ad-hoc network, the topology of the 
network may change rapidly. The mobility models 
represent the moving behavior of each mobile node in 
the MANET that should be realistic. This paper 
concerns performance of mobile Ad-hoc network 
(MANET) routing protocol with respect to the effects 
of mobility model on the performance of DSR 
protocol for the purpose of finding the optimal 
settings of node speed. In this paper, we evaluate the 
performance of DSR protocol using Random 
Waypoint Mobility Model in terms of node speed, 
number of connections, and number of nodes. 



Reactive Protocols: In this type of protocols such as 
DSR, AODV, the routes are created when it's required to 
send data packets from the source to the destination [1]. 
We have determined the impact of four factors on the 
performance of DSR by using Random waypoint 
mobility model in our previous paper [2] in press. These 
factors pause time, network size, number of traffic 
sources and routing protocol. We examine the impact of 
these factors on four performance metrics: packet 
delivery ratio, average end-to-end delay, normalized 
routing load and protocol overhead, hi this paper, we use 
Random waypoint as mobility model on DSR protocol to 
study the effect of node speed with other factors in order 
to find the optimal setting for the node speed parameter 
with different scenarios. For this performance study, we 
use Network Simulator 2 (ns-2) version 2.34. 



Keywords-MANET, Mobility Models, Routing Protocol, 
DSR Protocol. 



I. 



Introduction 



With existing advances in technology, wireless 
networks are growing in popularity. Wireless networks 
permit users the freedom to move from one position to 
another without break of their computing services. Ad- 
hoc networks is one of the subset of wireless network 
that dynamically forming a temporary network without 
using any existing network infrastructure or centralized 
administration. A major problem in ad hoc network is 
how to send data packets among mobile nodes efficiently 
without fixed topology or centralized control, which is 
the most important goal of ad hoc routing protocols. 
Therefore, it is necessary a high-quality routing protocol 
in order to establish the link between the nodes, since the 
mobile node can vary their topology regularly. In ad-hoc 
network the routing protocol is one of the important 
issue and most challenging research area, since mobile 
ad -hoc network vary their topology frequently. 
Generally, the major task of routing in a network is to 
detect and keep the best path to send data packets 
between source and target through intermediate nodes. 
There are two categories of routing protocols in ad hoc 
networks: Protocols: In this type of protocols such as 
DSDV, OLSR, consistent and up to-date routing 
information to all nodes is maintain at each node. ii. 



II. Dynamic source routing ( DSR ) 

DSR is reactive and efficient protocol. It determines 
the correct path only when a packet wants to be 
forwarded. The node broadcast the network with a route 
request and builds the essential path from the responses 
it receives. DSR allows the network to be fully self 
configuring with no need for any existing network 
infrastructure or administration. The DSR protocol is 
composed of two main mechanisms that work together to 
allow the discovery and maintenance of source routes in 
the ad-hoc network. All aspects of protocol operate 
entirely on demand allowing routing packet overhead of 
DSR to scale up automatically [3] [4]. 

Route Discovery: The example for route discovery 
shown in Figure 1. When a source node 1 wants to send 
data packets to the destination node 8, node 1 will 
broadcast Route Request Packet (RREQ) to all the 
neighbor nodes 2, 3, 4. After intermediate nodes receive 
theses packets will rebroadcast these packets to the 
destination if there is no route in the route cache. When 
the destination node 8 will receive RREQ, node 8 will 
inform the source node 1 by sending the Route Reply 
Packet (RREP). The source node will start sending the 
data packets to the destination through the intermediate 
nodes. This process mechanism called route discovery. 
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Route Maintenance: This mechanism contains two 
packets; Route Error Packet (RERR) and ACKs packets. 
Route error packet generated when there is changing in 
the network or node out of the transmission range which 
causes link failure. Intermediate node will send RERR 
to the source node. Source will check if there is route in 
the route cache to send the packets to the destination, if 
there is no alternative route. Source node will reinitiate 
route discovery process again. These processes will take 
long delay to re-establish again in order to send the data 
packets to the destination [5]. 




Fig. 1 . DSR Route Discovery 

III. Random way poitn model 

The Random waypoint model is widely and simply 
used to evaluate the performance of ad hoc routing 
protocols. The implementation of this model in the 
network simulator (ns-2) is as follows: each mobile node 
arbitrarily selects one position in the simulation field as 
the target, then moves towards this target with fix 
velocity selected uniformly and randomly from [0, 
Vmax], where the parameter Vmax is the maximum 
velocity for each mobile node [6] . The velocity and path 
of the nodes are selected separately from of other nodes. 
When will reaching the target, the node stops for a 
period of time defined by the 'pause time'. 

hi the Random waypoint model, velocity and pause 
time are two key parameters that determine the mobility 
performance of nodes. When the pause time is long and 
velocity is small the topology of ad-hoc network 
becomes stable. On the other hand, when the mobile 
node moves fast and the pause time is small; the 
topology is likely to be highly dynamic. Random 
waypoint model can create different mobility scenarios 
with different levels of node speed. In Figure 2, shows 
that node movement in the Random waypoint. 




Fig. 2. Traveling pattern of an MN using the Random Waypoint 
Mobility Model 



IV. RELATED WORK 

In the recent years, several works has been done by 
Perkins, Hughes and Owen [7] shows that some 
parameters such as pause time, node speed, increasing 
the number of nodes, and increase the number of 
sources can have an effect on the routing protocols 
performance. In their work Random waypoint model 
has been used, but employed Global Mobile System 
Simulator (GloMoSim) rather than ns-2. 

Azzedine Baoukerche has comparing four routing 
protocols such as (AODV, PAODV, DSR, and CBRP) 
[8]. The simulation parameters were tested in his paper 
with maximum number of nodes 25 and low traffic with 
maximum speed 20 m/s. However, our work tested with 
various numbers of nodes (10, 20, 40, and 80) with 
different source connections (4, 8, 30, and 40) which 
can make high traffic and various speed 20, 40, 60, and 
80. 

Yogesh, Yudhvir, and Manish, they have compared 
and analysis two reactive routing protocols such as 
AODV, DSR [9]. The main objective in their paper to 
evaluate the performance of these two protocols based 
on the packet delivery fraction, end -to-end delay, and 
normalized routing load. The simulation parameters 
were tested increasing number on nodes and various 
pause times with fixed maximum speed (0-25 m/s 
only). While our work with various maximum speeds in 
order to select the optimal setting for maximum speed. 



V. Simulation Setup 

The MANET network simulations are implemented 
using Random waypoint model which can generate by 
using movement tool (setdest) in ns-2 simulator. The 
simulation period for each scenario is 200 seconds and 
the simulated mobility network area is 1000 m x 500 m 
rectangle. Simulation runs are made with the number of 
random traffic Constant Bit Rate (CBR) which can 
generate by using (cbgen.tcl). Figure 2, it shows that the 
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simulation methodology for our implementation. The 
rest of simulation parameters shown in the Table below. 

TABLE I. SIMULATION PARAMETERS 





Parameters 


Value 


Simulation Time 


200 s 


No. of Nodes 


10, 20, 40, 80 


No. of connections 


4, 8, 30, 40 


Pause Time 


40 s 


Simulation Area 


1000 x500 m 


Traffic Type 


Constant Bit Rate (CBR) 


Maximum Speed 


20 ,40,60,80 m/s 


Mobility Model 


Random Way point 


Routing Protocol 


DSR 


MAC Type 


802.11 
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Fig. 3. Simulation Methodology 

VI. PERFORMANCE METRICS 

We have consider packet delivery ratio, end to end delay, 
protocol control overhead and normalized routing load 
as a metrics during our simulation in order to evaluate 
the performance of the DSR protocol. 

Packet Delivery Fractions (PDF): the packet delivery 
ratio is calculated by dividing the number of packets 
received by the destination through the number of 
originates packets by the application layer of the initiator. 
PDF is specifying the loss packets rate, which limits the 
maximum throughput over the entire network. The more 
complete and correct routing protocol, the better packet 
delivery ratio. 



PDF 



Number of received packets 
Number of sent packets 



Average end to end delay: this is defined as the average 
delay in transmission of a packet between two nodes and 
is calculated as follows: 



AED = 



Time packet received/ - time packet sent/ 
=o Total number of packets received 



Equation. 2 AED 

A higher value of end-to-end delay means that the 
network is congested and hence the routing protocol 
does not perform well. The upper bound on the values of 
end-to-end delay is determined by the application. 

Protocol Control Overhead: This is the ratio of the 
number of protocol control packets transmitted to the 
number of data packets received. 

Normalized routing load: this is calculated as the ratio 
between the numbers of routing packets transmitted to 
the number of packets actually received (thus accounting 
for any dropped packets): 



NRL 



Number of routing packets sent 
Number of data packets received 



Equation. 1 PDF 



Equation. 3 NRL 
This metric gives an analysis of routing protocol 
efficiency, since the number of routing packets sent per 
data packet gives an idea of how well the protocol 
maintains the routing information updated. The lower 
of NRL, the lower the overhead of routing protocol and 
consequently the higher the efficiency of the protocol. 

VII. Results and discussion 

hi this section, details of the simulation results in term 
of packet-delivery fraction, average end to end delay, 
protocol overhead, and normalized routing load. All the 
results were obtained by averaging 5 times over the 
simulation for every scenario in order to select the 
optimal setting for the maximum speed. 

i. Packet Delivery Fraction (PDF) 

This metric with high packet delivery ratio, routing 
protocol will be more efficient. Figure 4, shows that 
packed delivery ratio is decreased in first scenario 
whenever increasing node speed. As above stated, it 
has been taken four main scenarios and each of this 
main contains four sub-scenarios which means 4x4 = 
16 scenarios have been taken in this experiment. In 
each of these experiments of sub- scenarios, the node 
speed increases while other parameters are constant. In 
Figure 4, shows that the optimal setting is 20 speeds 
among all of the four scenarios in term of packet 
delivery ratio. 
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Hi. Routing Protocol Overhead 

hi Figure 6, it shows the overhead result which 
generated by the routing protocols to achieve this level 
of data packet delivery. Figure 6, shows that overhead is 
direct proportional to the number of sending packets, in 
the first scenario with low mobility overhead increased 
whenever node speed increased. With normal speed 20 
m/s in first scenario overhead decreased. The rest of 
scenarios with high mobility overhead is expect to 
increase and decrease because there are more 
destinations to which the network must maintain 
working routes. 



Fig.4. Speed vs. Packet Delivery Fraction 

ii. Average End-to-End Delay 

The average end-to-end delay is affected when the 
traffic Constant Bit Rate (CBR) is high rate of 
packets as well. The buffers become filled much 
faster, so the packets have to wait in the buffers a 
much longer period of time before they are sent. 
Figure 5, shows that effect of node speed on the end 
to end delay. In the first scenario 10 nodes with 4 
connections CBR, there is less delay with 20 speeds 
comparing with others sub -scenario 40, 60, and 80 
speeds. In addition when the number of mobile nodes 
(MNs) is increased to 80 nodes and 40 CBR 
connections with respect increase node speed up to 
80 m/s the end to end delay increases because of the 
time consumed for route discovery and the increasing 
number of packets in the buffer. However, when the 
pause time is 40 s and speed 20 m/s, the network is 
stable and the end to end delay decreases. With 
normal speed, the end to end delay is low because the 
network is not congested. 
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Fig. 5. Speed vs. Average end to end delay 
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Fig. 6. Speed vs. Routing overhead 
iv. Normalized Routing Load (NRL) 

In figure 6, the value of normalized routing load versus 
node speeds are plotted. From the figure 7, this is clear 
that DSR protocol performs well in the first scenario 
with node speed 20 m/s. Because of the NRL direct 
proportional with overhead and sending packets. NRL 
represents the number of routing packets transmitted per 
data packet delivered at the destination. This metrics 
checks the efficiency of the DSR protocols, meaning 
that with low NRL, DSR perform well. 
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VIIL CONCLUTION AND FUTURE WORK 

The most important parameter of a mobility model 
is a node speed, either in the form of a constant value 
or in the form of a certain distribution. In this paper, 
we have present our work on evaluating performance 
of DSR protocol under widely and simplest mobility 
model which called Random Waypoint with respect 
to the effect of node speed for different scenarios. 
Simulation result shows that node speed has 
significant on the performance of DSR protocol 
because mobility models are characterized by the 
movement of their constituents. The nature of 
movement its speed, direction, and rate of change 
can have a dramatic effect on protocols and systems 
designed to support mobility. The Mobile nodes 
(MNs) randomly select the next destination in the 
simulation area and choose a speed uniformly 
distributed between the minimum and maximum 
speed and travels with a random speed which is 
chosen uniformly. In addition, results shows that 
average optimal setting for our scenarios is when 
node speed is 20 m/s. The experimentation suggests 
that several parameters such as maximum speed, 
node density pause time and traffic source of nodes 
also affect the routing performance and need to be 
investigated with various mobility models. 
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Abstract: Among new software development processes, 
Agile Software Development (ASD) gives the software 
industry a new idea of quick and timely delivery of 
product. Agile methodologies got overwhelming response 
by all levels of software organizations. But limited scope 
of software designing and reusability of components do 
not let it to be made first choice of software development 
process and professionals. Agility addresses 
Multidimensional constraints like software design and 
Reusability, architecture and risk, iterations and 
changeability. Rapid development combined with 
changeability at later phases adds charm to ASD but 
missing designing and reusability act as a hurdle. 
Popularity of any software product is actually in length 
of its stay in market that ofcouse yields them rewards in 
terms of money compared to their investments. Agility's 
approach of development towards specialized 
components also lessens their probability of staying long 
in market. This paper aims to find how reusability by 
adding a bit of designing and developing specialized cum 
generalized components can be achieved in ASD. 

Introduction: Agile Software Development methods 
and techniques are being followed in the industry from 
the last decade to get quality product and to reduce 
development time. Rapid development and accommodate 
changes at any level of development gives the 
competitive advantage to the Agile processes over 
Traditional processes. But to get best the combination of 
both the processes is required. A proper degree of 
specialization and generalization needed to be 
maintained. Inclusion of architecture specific designing 
in ASD can make it a reliability prone approach i.e. ASD 
without risk. 

Reusability also contributes towards quality product and 
the rapid development. [19] reveals that Japanese 
projects also exhibited higher levels of reuse while 
spending more time on product design as compared to 
American teams which spend more time on actual coding 
and concludes that Indian firms are doing great job in 
combining conventional best practices, such as 
specification and review, with more flexible techniques 
that should enable them to respond more effectively to 
customer demands. If such a trend is replicated across the 
broader population, it suggests the Indian software 
industry is likely to experience continued growth and 
success in future. 
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Figure 1 : The Evolution of software process models [6] 

Figure 1 shows the evolution of software development 
processes. Waterfall model was being followed where 
requirements are fixed and the next phase starts when the 
earlier one finished. It's representative of the traditional 
methods. To overcome the limitations of waterfall model, 
evolutionary model and spiral model comes into picture 
where prototype is first made and then that is converted 
to the working software. But all have one common 
limitation that no process could handle the change of 
requirements at later phases. Agile development which 
include many methodologies as XP, SCRUM, Lean 
Software Development, FDD,DSDM is being accepted 
in industry because of adaptation to change even at the 
later stages of the development and also for rapid 
development. 

Any method to be agile the values and principles of the 
Agile Manifesto (Agile Alliance 2001) set out the central 
elements of agility. "We are uncovering better ways of 
developing software by doing it and helping others do it. 
Through this work we have come to values: 
Individuals and interactions over processes and tools 
Working software over comprehensive documentation 
Customer collaboration over contract negotiation 
Responding to change over following a plan 
That is, while there is value in the items on the right , we 
have the items on the left more." [agilealliance.org] 
The twelve principles of agile software development 
(Agile Alliance 2001) are: 

1) The highest priority is to satisfy the customer through 
early and continuous delivery of valuable software2) the 
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welcoming of changing requirements, even late in 
development, for the benefit of the customer's 
competitive advantage, 3) frequent delivery of working 
software, the release cycle ranging from a couple of 
weeks to a couple of months, with a preference for a 
shorter timescale,4) daily collaboration of business 
people and developers throughout the project,5) building 
of projects around motivated individuals by offering 
them an appropriate environment and the support they 
need, and trusting them to get the job done, 6) emphasis 
on face-to-face conversation for conveying information 
and within a development team, 7) working software is 
the primary measure of progress, 8) agile processes 
promote a sustainable development pace for the 
sponsors, developers, and users, 9) continuous attention 
to technical excellence and good design enhances agility, 
10) simplicity is essential for maximising the amount of 
work not having to be done, 11) self-organising teams 
give best results in terms of architectures, requirements, 
and designs, 12) regular reflection of teams on how to 
become more effective, and tuning and adjusting its 
behaviour accordingly. 

These days extensive research is being carried out to get 
best of agile development as [9] shows in a tree that no 
agile process follows all the principles. Lean software 
development has five bottlenecks , XP itself has two , 
SCRUM has two and FDD also has seven bottlenecks. 

As many authors say that agile development becomes 
industry standard but Agile processes also have 
limitations as[8]discusses the limitations of agile on its 
1 1 assumptions which says none of the agile processes is 
a silver bullet to fit all these assumptions. 
To get best traditional approach and agile approach has 
to combine. [7] says that the companies quite expertly 
combine agile and traditional practices and adjust their 
practices according to the situation at hand. Figure 2 
shows the effects (benefits or drawbacks) in both the 
methods and also insists on the cumulative methods 
development since there has been a movement from no 
methods, via tradidional method to agile method. 
Research is going on to combine agile with other 
processes , models .One is in [11] which concludes 
through a case study that SPLE(software product line 
engineering ) and agile software development are 
complementary to each other. 



Agile 

methods 



Traditional 

methods 




Step 1 . Rate the project's environmental, agile, and plan- 
driven risks. If uncertain about ratings, buy information 
via prototyping, data collection, and analysis. 



Step 2a. If agility risks dominate plan-driven risks, go Risk- 
based Plan-driven. 



Step 2b. If plan-driven risks dominate agility risks, go Risk- 
based Agile. 



Step 1 If parts of the application satisfy 2a and others 2b, 

architect the application to encapsulate the agile parts. 
Go Risk- based Agile in the agile parts, and Risk- 
based Plan-d riven elsewhere. 



Step 4. Establish an overall project strategy by integrating 
individual risk mitigation plans 



Step 5. Monitor progress and risks/opportunities, readjust 
balance and process as appropriate. 



Table 1: Summary of Risk based approach.[15] 

Table 1 shows a risk based approach to develop a 
balanced development strategy. [16] discusses about the 
process which appears to be generic i.e. amenable to use 
for building any type of system, including web 
applications; in a context where risk analysis is 
important. 

One key research area related to agile processes is in 
software process improvement. [18] reflects such a need 
as in Table 2 the differences between traditional and agile 
software development approaches gives an iterative 
process improvement technique as a solution with five 
case studies. 
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Figure 2: Relation between identified effects [14] 



Table 2: Underlying differences of traditional and agile 
software development and SPI [18] 

[10] shows how the CMMI could be used in assessing 
agile software development or in the situation where 
organization is planning to change its process towards 
agility. Following Table 3 concludes that "While CMMI 
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creates an organizational discipline; XP eases the daily 
life by providing pragmatic, end-result-oriented 
practices. CMMI and XP can be used together very well 
and their synergy is very strong." 
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Table 3: Relationship between some of the CMMI 
process and some XP practices [17]. 

Among the limitations of agile methods mentioned in 
[12] one is the lack of attention to design and 
architectural issues. Boehm has done great work on 
architecture and pointed out that there is a risk of 
architectural mistakes that cannot be detected easily by 
external reviewers due to lack of documentation in agile 
development. 

Software Architecture and Agile Software 
Development: 

Software architecture of a program or computing system 
is the structure or structures of the system which 
comprise software elements , the externally visible 
properties of those elements and the relationships among 
them. [23] 

To accommodate changes at any level of development 
results in compromise on quality in lightweight 
processes. Moreover agile development produces 
specified products. Our interest is in how much agile can 
contribute to produce generalized products, reusable 
artifacts. [20] maintains that these two (agile approaches 
and software architecture) seemingly opposing views to 
software engineering can be integrated but it requires that 
experts from both fields work together to overcome 
evident challenges in bridging these two paradigms 
together and insists on the need of research on 
integrating architecture-centric methods in agile 
approaches. 

In architecture oriented agile software development , the 
main considerations are in which iteration the 
architecture will be designed, how much extra cost has to 
bear, is customer interested in architecture development. 
Future research is open in architecture oriented agile 
software development to get answers to these questions 
so as we can get more quality product and more 
productivity. 

Moreover, Software architecture research aims at 



reducing development costs by identifying 
communalities among closely related products. Software 
architecture entails the principal design decisions 
concerning the system and is rather orthogonal to the 
development process [2]. Architect has to detect non- 
functional requirements from the requirements stated by 
the customer. He has to work with His vision. [23] shows 
there could be quality attribute trade -offs which should 
be taken care off. It is also point of consideration that in 
agile development the architect is one of the developer's 
team or an individual one. 

Though introduction of architecture reduces many risks 
but it also introduces many risks called architectural 
risks. [21] Summarizes that architectural risks in agile 
processes can be handled by two ways. Architectural 
risks that we know in advance can be handled in a time 
boxed iteration zero, where no features are planned to 
delivered. Small architectural risks can be handled as 
they arise during iterations, but large architectural risks 
must be promoted to be on par with features, and inserted 
into a combined feature and risk backlog . 
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Figure 3: Feature and Risk Backlog based ASD[21] 

One way to incorporate risk into an agile process is to 
convert the feature backlog into a feature and risk 
backlog. The product owner adds features and the 
software teams add risks. The software team must help 
the product owner to understand the technical risks and 
simply prioritize the backlog [21]. 

Architectural Patterns: An architectural pattern is a 
description of element and relation types together with a 
set of constraints on how they may be used. [23] Some 
authors specify the patterns what D.Garlan and His 
collaborators call styles. In most cases arch patterns are 
considered in close connection with object orientation. 
Object oriented language constructs like abstract classes 
or inheritance, which support the architectural pattern 
idea in a very elegant way. [4] 

Pattern is being decided on the non-functional 
requirements of the product. Single pattern or 
combination of patterns is being used to design the 
architecture by the architect by keeping in mind the 
hindrance to each of the non-functional properties 
because of one another. 

As patterns are already fully tested and can be easily 
adapted ,enhances the reusability . One negative point 
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attached to it is that Pattern reuse depends upon several 
factors. [2] 

From the software evaluation point of view, if the 
abstract scenarios that characterize the quality attributes 
satisfied by the patterns used in the software are available 
,it will improve software architecture evaluation and 
reduce the time and resources required to gather 
scenarios from scratch for each evaluation effort. [3] 

Reusability: Number of techniques are available to 
support reusability. Considerable research and 
development is going on in reuse; industry standards like 
CORBA have been created for component interaction; 
and many domain specific architecture, toolkits, 
application generators and other related products that 
support reuse and open systems have been developed]!]. 
Architecture reusability can be increased by defining 
levels as in FIM architecture [5] which operates at three 
different levels of reuse: Federation, domain and 
application. 

Agile software development and reusability in the 
software engineering lays the same foundation of the 
quality product and reduction in development time. To 
get a generalized product through agile processes 
software architecture has been introduced which supports 
modifiability also. A repository can be built to place 
various artifacts like patterns, components, and reference 
architectures in ASD. 

Focus on how a non-functional property reusability 
relates to the software architecture of a system [13]. [22] 
suggested software process model for reuse based 
software development approach. 

Conclusion: Reusability reduces the complexity of 
design process. Introduction of software architecture 
reduces the risks and increases the modifiability at later 
stages even. It is concluded that agile development which 
has promising future in the software industry and can 
fulfill the demands of the industry can be improved more. 
Adding a slight touch of traditional approach, Object 
oriented patterns for reusability at design, code and test 
level, and architecture specific designs will definitely 
make space for reusability and reusable artifacts. 
Architecture oriented agile development is an open 
research area. Designing and developing specialized cum 
generalized components can be achieved by using object 
oriented patterns that is a new dimension to be explored 
more for effective ASD. 
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Abstract — Intrusion Detection Systems (IDS) are one of the 
well-known systems used to secure the computer environments, 
these systems triggers thousands of alerts per day to become a 
serious issue to the analyst, because they need to analyze the 
severity of the alerts and other issues such as the IP addresses, 
ports and so on to get better understanding about the relations 
between the alerts. This will lead to have a better 
understanding about the attacks. This paper Investigates the 
most popular aggregation methods, which deals with IDS 
alerts. In addition, we propose Time Threshold Aggregation 
algorithm (TTA) to handle IDS alerts. TTA is based on time as 
a main component to aggregate the alerts. On the other hand, 
TTA supports aggregating alerts without threshold, which can 
be done by setting the threshold value to 0. 



Keywords — Intrusion Detection System, 
Redundant Alerts, Alert Aggregation. 



False Positive, 



I. 



INTRODUCTION 



The reason behind creating intrusion detection systems 
(IDS) is because of the huge amount of threats and attacks 
over the internet and wide networks. In the other, hand IDS 
triggers huge amount of alerts because of these threats; 
therefore managing and controlling these alerts need to be 
studied which led the researchers to investigate these alerts to 
create methods and techniques such as aggregation to 
minimize the amount of alerts and group them to make them 
fewer and to reduce the analyzing process time. Such a 
progress like this directed to minimize the false positive of 
IDS too. A good knowledge of IDS and their alerts should be 
known for better understanding of the aggregation technique. 



A. Intrusion Detection System (IDS) 

IDS as a system triggers alert or a group of alerts if there 
is an intrusion of the monitored network based on analyzing 
the activities, these activities are collected from the network 
packets stream. IDS has two ways of detecting intrusions 
either by using anomaly [1] technique or misuse technique 
[2] or by merging both techniques starts by checking whether 
the attack signature saved in the database as a misuse 
technique then apply the anomaly techniques to check if it is 



anomaly attack. Misuse detecting techniques look for a 
malicious signature or pattern of the threat based on a set of 
rules or signatures to detect intrusive behavior while anomaly 
detection technique determines the abnormality of network 
flow by measuring the distance between the suspicious 
activities and the norm based on a chosen threshold. The 
main differences between these two techniques are based on 
detecting the novel attacks and the false positive rate, where 
anomaly techniques can detect novel attacks and they have a 
high rate of false positive, misuse techniques in the other 
hand have low rate of false positive without the ability of 
detecting novel attacks. To differentiate between these two 
techniques and have a better background you may refer to[3- 
6]. 



B. IDS Standard Alerts Format 

There is a variety on the sensor types, these sensors 
trigger a non standard formats of alerts, which led to create 
the standardization format. One of these standards is the 
Intrusion Detection Message Exchange Format (IDMEF). 
This standard was built with Extensible Markup Language 
(XML) and it has the flexibility to accommodate different 
needs [7] . 

II. AGGREGATION TECHNIQUES 

Aggregation technique is one the major parts of IDS 
studies for grouping and minimizing the alerts to ease the 
process of analyzing them by removing the redundant alerts. 
Aggregation techniques group the IDS alerts based on the 
similarity of the alert features, since some of the alerts related 
to one event usually they have similar features, so they will 
be aggregated into one alert. This paper will try to give the 
answers of the following questions: how to define the alert 
features? How to calculate the similarity of them? 

Valdes [8] proposed an aggregation algorithm by including 
the five features: source IP addresses, source ports, 
destination IP addresses, source ports and alert generation 
time. The compression result of each feature is a value 
between and 1, while the similarity calculation and the 
weights of each feature depend on predefined values. But the 
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researcher didn't mention the method of defined the 
similarity and the weights values. Another proposed solution 
by [9] was based on the exact matching which gives us the 
result of or 1, so this algorithm is weak because it reduces a 
little amount of alerts. Another approach based o [8] 
algorithm's done by [10] give us slightly different 
experiment results because he used only the source IP 
addresses and alert generation time. 



Different aggregation technique is introduced by [11], this 
technique aggregates the alerts by categorizing their features 
into four classes, then a similarity operator used to compute 
the similarity of the same features class, but there is no 
discussion on the computation methods for each features 
class. Oliver [12] suggested that the alerts should be 
categorized by attack intensions basis, using the subsequent 
aggregation processes. Investigating the ideas from the 
previous proposed methods leads to this proposed algorithm. 
Time threshold aggregation algorithm (TTA) works on 
extracting the features' alerts to categories them into groups 
based on the similarity on these features maintaining the 
integrity of the alerts' features. 



False Alert 



New 
Alerts Group 



If 

Ri == Ri+1 
For all j 



Compare |Ri-Ri+1| 



Set Threshold 
Value Th_T 



Read Alerts as 

n=(Ri, R2, ..., Rn) 



Figure 3.2 TTA Algorithm 



III. PROPOSED ALGORITHM 

As mentioned in previous sections, based on some of 
alerts' features or all of them, the researchers are trying to 
make their aggregation algorithms without any consideration 
on the alerts' trigger time. In this proposed algorithm, the 
merging of any two alerts or more will be based on a 
threshold value, which should give more accuracy 
combination results. Figure 3.1 explain the TTA algorithm. 

I. Time Threshold Aggregation algorithm 

TTA works as illustrated in following: 

(1) Read IDS alert as n = (R b R 2 , . . ., R n ) 

(2) Get the first row items as Rj where i = { ji, j 2 ,.•> js } 

(3) Set the Threshold Th_T value 

(4) Iteration I = n-1 

(5) Compare the Rows by \R t — R i+1 \ 

(6) Update the R iQ Value 



Ri 8 - Ri+ljf Ri+U 



Ri+i 2 , - , Ri+i 8 - 



fori!,..., i 7 =0&|flj <Th_T) 



(7) WhileI>lDo 

(8) Delete R i+1 if(^ +1 . 

(9) 1 = 1-1 

TTA based on the Row Echelon Form [13] Concept, in 
TTA we conseder the redundunt alerts as false positive 
alerts if there is an exact matching (Rj = R i+ i) for ii,..., i 8 , 
and if the different is only in the time feature is repeated i 8 
we conseder it as a real alert. 



To understand the algorithm of ATT, check the 
following Example: 

(1) Let the sample of the A took from the table 3.1 
and the Th_T = 2. From table 3.1 we get Ai = {4, 
1, 2, 2, 3, 1, 2, 1}, A 2 = {4, 1, 2, 2, 3, 1, 2, 2}, A 3 = 
{4, 2, 1, 2, 3, 1, 3, 2}, ..., A 13 {4, 1, 2, 2, 3, 1, 2, 9}. 

Table 3.1 Example of A 
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(2) We Multiply (A 2 , . . ., Ai 3 )*-1 then do apply 



A±6 — \A _ A±6 



(1) 



A 2 = {-4, -1,- 2, -2, -3, -1, -2, -2}, ..., A 13 {-4, -1,- 
2, -2, -3, -1, -2,- 9}. After we apply equation 1 the 
result will be like this Ai = {4, 1, 2, 2, 3, 1, 2, 1}, 
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A 2 = {0, 0, 0, 0, 0, 0, 0, 1}, A 3 = {0, 1, 1, 0, 0, 0, 0, 
2}, ...,Ai3 {0,0, 0,0, 0,0, 0,3}. 

(3) After each time we apply equation 1 we check if 



A i need to be updated by 



A i 8 - A i+i 8 
(4) Equation 



(2) 



will 



be used only 
A i+ll ,A i+l2 , ... ,A i+lQ = , this equation can be 
use in the case of A 2 , A 4 , A 8 , An as {2, 3, 5, 6} in 
the case of A 13 A 13 has not been updated because 



*13, 



> Th_T. 



(4) We eliminate the zero's rows regarding ii, . . ., i 7 to 
get a new set of A as table 3.2 









Table 3.2 New set of A 
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(5) We repeat step (1, 2, 3, 4, 5) until there is no rows 
left. 



B = 



h] 




f£X2 2 1 3X3^ Al 
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We use the set final set of N to the new alerts' dataset, and 
we keep repeating the algorithm steps until we get empty N 
set and B = [i x ] = [7 1421113] 
The final aggregated file Agg will be like follow 



Agg = 



4 12 2 3 12 1 

42123132 

L7 1 4 2 1 1 1 3J 



Table 4.2 Aggregated Alerts Agg 
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II. Mathematically Proof : 
If we consider A as the alerts' dataset then 



/' 



A = 



4 
4 
4 
4 
7 
4 
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2 
1 
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2 1 

2 3 

2 1 

2 3 



2 \ 
2 2 * 



UvJ 



Where B is the set of alerts related to the hyper alerts 
(Produced alerts from A) and N is the set of alerts 
representing the false positive alerts (None related). 



From the first iteration we got z"i : 
and from second iteration we got r\i 
therefore: 



[0 01] 
[01-100011] 



III. Using Time Threshold Aggregation algorithm on IDS 
Alerts 

As mentioned in section 1 and 2, alerts contains many 
features in TTA we focus in 8 features to do the aggregation 
(Source IP, destination IP, Source port, destination port, 
Severity, Protocol, Alert Classification and Time) so if we 
took each alert as a row in our algorithm it can give a 
promising results. 



IV. USING TIME AS A MAIN FEATURE 

Most of the previous studies didn't take the alert trigger time 
as one of the extracted features. We believe the time 
threshold will effects the accuracy of the aggregation result 
by taking it as one of the aggregation features, based on the 
alert trigger time, the process of analyzing the alerts will be 
easier. The analysts would like to know the severity of the 
alerts based on the amount of the alerts by the same features 
which this algorithm can show. Based on the threshold Th_T 
that the user will select; the amount of the aggregated alerts 
will be changed. 
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V. Discussion 

This paper presented the ATT algorithm for aggregating 
alerts from any intrusion detection systems. The main 
advantage of the proposed framework is to improve the alert 
aggregation process especially when it is related to triggered 
alert time. The advantages of ATT are: to minimize the 
amount of alerts, remove the redundant alerts and to remove 
the false alerts. 



Other benefits of the proposed algorithm are: Firstly, to 
obtain the most benefit from the alerts by making use of the 
supporting features from the alerts itself which is controlled 
by the user. Secondly, by analyzing any type of alerts in a 
standard format, the algorithm provides flexibility to make 
use of the enriched alerts for aggregating purpose rather than 
any complicated techniques. Thirdly, this algorithm can ease 
the study of the alerts severity when they can be related to 
the aggregated alert groups. Finally, using this algorithm will 
give us accurate and less number of alerts since it is based on 
the threshold value which can be modified by increasing or 
decreasing it. By setting the threshold value to the alerts 
will be aggregated by exact matching. In other words, the 
aggregated alerts should be approximately aggregation 
with the same amount of output alerts, and since it is very 
hard that two alerts will be triggered in the same time from 
the same sensor or the same IDS, the amount of the 
aggregated alerts is high. By increasing the value of the 
threshold the amount of output alerts will be decreased. After 
a number of trials the user can tell what are the right value of 
threshold should be. 



VI. Conclusion and Future Work 

TTA can be used as an aggregation method to any file 
containing a group of Items with extracted features. In the 
stage of programming TTA, it should give the user the 
ability to choose the number of extracted features from the 
IDS alerts. TTA can be implemented in using parallel 
technique for the comparison part between the alerts to 
give a better time results. 
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Abstract — Machine vision for industry has generated a great 
deal of interest in the technical community over the past 
several years. Extensive research has been performed on 
machine vision applications in manufacturing, because it has 
the advantage of being non-contact and as well faster than the 
contact methods. Using Machine Vision, it is possible to 
evaluate and analyze the area of the surface, in which machine 
vision extracted the information with the help of array of 
sensors to enable the user to make intelligent decision based on 
the applications. In this work, Estimation of surface roughness 
has been done and analyzed using digital images of machined 
surface obtained by Machine vision system. Features are 
extracted from the enhanced images in spatial frequency 
domain using a two dimensional Fourier Transform and 
Wavelet Transform. An artificial neural network (ANN) is 
trained using feature extracted values as input obtained from 
wavelet Transform and tested to get R t as output. The 
estimated roughness parameter (R t ) results based on ANN is 
compared with the R t values obtained from Stylus method 
and the best correlation between both the values are 
determined. 

Keywords — Surface roughness, Machine vision, Milling, 
Grinding, Wavelet Transform, Neural Network. 

I. Introduction 

The quality of components produced is of main concern to 
the manufacturing industry, which normally refers to 
dimensional accuracy, form and surface finish. Therefore, 
the inspection of surface roughness of the work piece is 
very important to assess the quality of a component, which 
is normally performed using stylus type devices, which 
correlate the vertical displacement of a diamond-tipped 
stylus to the roughness of the surface under investigation. 
But, the limitations of stylus techniques have already been 
reported in detail in [6, 5, 4]. Machine Vision typically 
employs a camera, a frame grabber, a digitizer and a 
processor for inspection tasks where precision, repetition 
and/or high speed are needed. The histograms of the surface 
image have been utilized to characterize surface roughness 
and quality. Fourier transform (FT) of the digitized surface 
image in which the magnitude and frequency information 
obtained from the FT are used as measurement parameters 
of the surface finish. These methods use the basic 
assumption that the surface of the specimen is completely 
flat and there is no inclination when the images are 
captured. Even a small inclination of the specimen may 



result in inconsistent estimation of roughness of 
components using machine vision primarily due to the fact 
that illumination, shadow on the images is likely to be 
different. 

In this work, the machined surfaces are captured 
using a Machine Vision system. Following the image 
enhancement, the features are extracted and then the 
roughness parameters are estimated and analyzed. Here 
wavelet is used to extract the features of the enhanced 
image, and an artificial neural network (ANN) is developed 
to predict the surface roughness. The results are compared 
with that obtained using the standard stylus method. 



II. Roughness parameters 

The machined surfaces are generally characterized by three 
kinds of errors (i) form errors, (ii) waviness, and (iii) 
surface roughness. The concept of roughness is often 
described with terms such as 'uneven',' irregular', 'coarse 
in texture', broken by prominences', and other similar ones 
(Thomas, 1999). Similar to some surface properties such as 
hardness, the value of surface roughness depends on the 
scale of measurement. In addition, the concept roughness 
has statistical implications as it considers factors such as 
sample size and sampling interval. The one parameter that 
is standardized all over the world and is specified and 
measured far more frequently than any other is the 
arithmetic average roughness height, or Roughness 
Average. Universally called Ra, it was formerly known as 
AA (Arithmetic Average) in the United States and CLA 
(Center Line Average) in the United Kingdom. It is defined 
as the arithmetic mean of the departures of the profile from 
the mean line. 

Rq (or also known as RMS) is the root mean-square 
average of the departures of the roughness profile from the 
mean line. Rq has statistical significance because it 
represents the standard deviation of the profile heights and 
it is used in the more complex computation of skewness, 
the measure of the symmetry of a profile about the mean 
line. 



^w^y 



...(i) 
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A. Profiles for Turned Machined Components 

Figure 1(a) and figure 1(c) shows the profiles obtained 
for a turned component with a stylus instrument. Similarly, 
figure 1(b) and 1(d) shows the gap profiles obtained for the 
same turned components by diffraction method. In both 
graphs, 'z' is the deviation of the points on the profile from 
the mean-line. It can be observed that appreciable 
differences in the diffraction pattern are seen for large 
variations in the gap and therefore good comparison of 
results is guaranteed in both only for turned components of 
medium roughness. For very rough surfaces scattering is 
observed. A limitation in the usage of the different methods 
is that the smoothness of the edge plays a crucial role in the 
evaluation of the finish of the components. 
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Fig. 1(a), (c) The profiles obtained for a turned component 
with a stylus instrument, (b), (d) the gap profiles obtained 
for the same turned components by diffraction method. 



B. Profile for Ground Machined Components 
Figure 2(a) shows the profiles obtained for a ground 
component with a stylus instrument. Similarly, figure 3(b) 
shows the gap profiles obtained for the same ground 
components by diffraction method. In both graphs, 'z' is the 
deviation of the points on the profile from the mean-line. 
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Fig. 2(a) The profile obtained from a ground component 
with a stylus instrument, (b) The gap profile obtained for 
the same ground component by diffraction method. 

III. Spectrum techniques for feature extraction 

A. Fourier spectrum 

The Fourier spectrum is the frequency domain 
counterpart of the autocorrelation function. The FT of the 
correlation is used, which corresponds to the power spectral 
density function and describe how the power in a signal is 
distributed over frequency. The power spectrum can reveal 
the presence of offset, or periodic structures in a data set. 

B. Wavelet Transform (WT) 

The wavelet is a tool in surface texture analysis and can 
decompose a surface into multi-scale representation in a 
very efficient way. The wavelet transform (WT) is a 
mapping of the signal to the time-scale joint representation. 
By WT, the decomposition of a signal with a real 
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orthonormal bases ^ mw (x) obtained through translation and 
dilation of a kernel function ^x) known as mother wavelet 
as given in eqn. [2], 

V v ix) = 2-' 2 yi2-~x-m) (2) 

Where, m,n are integers. To construct the mother wavelet 
^x), it is required to determine a scaling function ty(x) 
given in eqn. [3], 

<f>(x) = 4l Y, h(k)$(2x - k) 

k ...(3) 

Then, the mother wavelet ^x) is related to the scaling 
function as in eqn. [4], 

Y{x) = J2Y J g(k)(j>{2x-k) 

t ...(4) 

where, 

g(k) = (-l) h h(l-k) 

The coefficients h(k) have to meet several conditions for the 
set of basis wavelet functions to be unique, be orthonormal 
and also have a certain degree of regularity. 

C. Wavelet Transform for Signals 

In two dimensional cases, the one dimensional wavelet 
transforms are applied along both the horizontal and 
vertical directions (j)(x) is a one dimensional real, sequence 
integral scaling function defined as in [5] 

<f> jk (x) = 2^<f>(2 j x-k) ...(5) 

Translation k determines the position of this one 
dimensional function along the x- axis, scale j determine its 

j/ 
width along x axis and 2 /2 controls its height and 

amplitude. This one dimensional scaling function satisfies 
these conditions: 

■ </> k is orthogonal to its integer translates. 

■ The set of functions that can be represented as a 
series expansion of (/> . k at low scale is contained 

within those at higher scale. 

So, the difference between any two sets of (fij^ is 

represented by a companion wavelet function lf/j k defined 

V 

in eqn. [6], y/ k (x) = 2 /2 i//(2 J x-k) 

...(6) 

Then, the 2 dimensional DWT functions are the 
linear products of scaling and wavelet functions 0( x ) 
and y/(x) yielding the eqn. [7] through eqn. [9]. 
y/ H (x,y) = y/(x)4{y) ...(7) 

ir r (x,y) = Kx) ¥ (y) ...(8) 



where, y/ H ( x , y) , i// v (x,y)mid y/ D (x, y) are called the 
horizontal, vertical and diagonal wavelets. Thus, DWT is 
well localized and allows decomposition in three directions, 
namely, horizontal, vertical and diagonal respectively. 

D. Features of Wavelets 

In this application, the features are extracted using a 
wavelet which belongs to a family of orthogonal wavelets. 
The mother wavelet (DB4), its corresponding scaling and 
wavelet functions and the decomposition filters are shown 
in Figure 3 and Figure 4 respectively. 
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y/ D {x,y) = y/(x)i//(y) 



... (9) 



Fig. 4 Decomposition of low-pass filter h ^(-n) and high- 
pass filter h w (-m) 

The DB4 scaling function is given by 

a l =h s 2l +h l s 2l+l +h 2 s 2l+2 +h 3 s 2l+3 ...(10) 

a[z'] = V[2z']+V[2/+l]+/z 2 5 , [2/+2]+V[2z'+3] •■• (11) 
The Daubechies DB4 wavelet function is given by 

C i = &0 S 2i ~*~ &l S 2i+l ~*~ §2 S 2i+2 "*" &3 S 2i+3 "• v^) 

c[/] = g j[2/] + g 1 j[2/+l] + & j[2/+2] + g 3 j[2/+3] .-.(13) 
IV. Neural networks for surface roughness 

ASSESSMENT 

The roughness features extracted from the machined 
images, are fed as input to an ANN to predict the roughness 
value R t ANN consists of a number of elementary units 
called neurons. A neuron is a simple processor, which can 
take multiple inputs and produce an output. Each input into 
the neuron has an associated weight that determines the 
"intensity" of the input. The processes that a neuron 
performs are: multiplication of each of the inputs by its 
respective weight, adding up the resulting numbers for all 
the inputs and determination of the output according to the 
result of this summation and an activation function. Data is 
fed into the network through an input layer, it is processed 
through one or more intermediate hidden layers and finally 
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fed out of the network through an output layer as shown in 
Figure 5. 




Input Layer Hidden Layer Output t.ay^r 

Fig. 5 Typical ANN network 

V. Proposed System for Surface Roughness 

EVALUATION 

The methodology and block diagram of proposed Machine 
vision system is shown in Figure 6(a) and Figure 6(b). 
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components). Calculate the weighted standard 
deviations of three detailed images. 
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...(14) 



m M 



where, « = Standard deviation of the M detail image at i 
Level 
M=H(Horizontal)/V(Vertical)/D(Diagonal) 
component 

The standard deviation of each sub image at level i is 
weighted by the factor (1/2/-1), 

(iv) Repeat steps 1-4 four times for original image 
and images at orientation 90°, 180°, and 270 ° 
(achieved by rotating original image). 

The final feature set consists of 4*(3L) features. 



Fig. 6 (a) Block diagram of proposed system 
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Fig.6 (b) Methodology in the proposed computer vision 
system for measuring surface roughness 



A. Algorithm for feature extraction 
(i) 
(ii) 



(iii) 



Carry out image enhancement of machined 

image 

Subject the enhanced image to a L-level 

discrete wavelet decomposition. 

At each level (i=l, 2, ... L), there are four 

sub-images. One approximation image and 

three components/images (LH, HL and HH or 



B. Wavelet based Feature Extraction 

Since, the wavelet coefficient are orthogonal, the 
original profile can be re-obtained after wavelet 
decomposition by simply adding the sub-scales signals as 
shown in Figure 7. Furthermore, using this simple 
summation technique the concepts of roughness, waviness 
and form can be preserved. This is reflected in Figure 8 
where, an arbitrary decomposition of a surface texture is 
obtained by casting into three frequency components, 
representing the form, waviness and roughness, using 
Daubechies wavelet of order 20. A dimensional step can 
now be cleared. Indeed, the same kind of decomposition 
process can be performed using images instead of profiles, 
because surface roughness can be measured precisely using 
for instance optical surface measurement systems. The 
arbitrary decomposition into form waviness and roughness 
of surface textures obtained by casting, grinding and 
vertical milling respectively, using wavelet of order 20 is 
shown. The roughness average of each component (i.e. 
form, waviness and roughness) is also shows in order to 
illustrate the roughness scale. The measured area is of a few 
millimeters square. Hence, the wavelet tool allows the 
decomposition of surfaces into form, waviness and 
roughness components and can successfully replace 
standard filters that are commonly used in surface texture 
characterization and hence, give a solid theoretical base for 
the standardization of these filters. 
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Fig. 7 Multiscale decomposition of a surface texture profile 
obtained by casting under seven different scales using the 
wavelet of order 20 
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Fig. 8 Multiscale decomposition of a surface texture profile 
obtained by casting under three different components (form 
waviness and roughness) using the wavelet of order 20. 

In Figure 9, the FNWT maxima indicate at each scale the 
location of a frequency component. Those features can also 
be quantified according to both the shape of the 
corresponding peak and its height. For an image, when 
using a multiresolution scheme for a dyadic standard 
decomposition of a function into sub-bands a filter bank 
with a power of two number of filters should be used. When 
using orthogonal wavelets like ones, one can easily simplify 
the problem by gathering the channels by scale in both 
directions. This process applied to a discrete wavelet is 
called the scaled DWT. The frequency normalization can 
then be performed based on these filters. 



\\ :\:<: .-] Transform 



fd\r^I ' * VT-VV/l/ \^f OrHndprnrlle 




M 


«0 


«a 


80 


100 


iu 




20 


<0 


<m 


80 


100 


20 
















20 


m 


ea 






2u 



s.v.k' : 



20 40 SO 60 1M «0 20 <0 ft) 80 



20 40 60 BO 



V^WWa/v 




20 40 




20 40 *} 



20 10 SO 80 100 "20 



^/d4V/Av^^ 



20 40 SO SD 



120 o so id ea ao loo j :o 



Fig. 9 Comparison of multiscale decomposition of a surface 
texture profile obtained by casting under seven different 
scales using the wavelet of order 20 and its frequency 
normalized equivalent. 



C. ANN based surface roughness estimation 

ANN with a variation of the classic back-propagation 
algorithm is employed to predict surface roughness. 
Compared with more conventional approaches, ANN 
demonstrates certain advantages that make them much 
more attractive. They have the ability to recognize patterns 
that are similar, but not identical, it can store information 
and generalize it. There is no need for explicit statement of 
the problem or for a problem- solving algorithm. Due to 
their massive parallelism, ANNs display increased 
computational power that can be used to deal with complex 
problems. Back-propagation neural network used for 
estimating the surface roughness of the machined surfaces 
with is a four layer network with six nodes in the input 
layer, six nodes in the first hidden layer, five nodes in the 
second hidden layer and one single node in the output layer. 
Each layer is fully connected to the succeeding 
layer. The outputs of nodes of one layer are transmitted to 
nodes in another layer through links. The structure of an 
ANN is shown in Figure 10 where the Energy maps are fed 
as inputs into the trained neural network and the surface 
roughness parameter (R t ) is estimated . In the training 
phase, the desired value of the node in the output layer is 
the actual roughness value, R t obtained by stylus method. 
The ANN adjusts the weights in all connecting links such 
that the mean square error, i.e. the averaged squared error 
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between the network output and the desired output is 
minimized. Training of ANN is stopped as soon as the 
specified number of epochs has reached and the values of 
weights corresponding to the minimum error are restored. 
Once trained, the ANN is then tested for different sets of 
input data. In the testing phase of the neural network, the 
predicted roughness, R t is the value of the node in the 
output layer. 




V- llidikn Layer 



2 sl Midden Layer 

I? rudcyl 



Output Layers 



Fig. 10 The System architecture of ANN used for predicting 
Ra for surfaces 



VI. RESULTS AND DISCUSSION 

Case 1: Feature Extraction using Scaled DWT 

The blocks contain time reversed scaling and wavelet 
vectors. The h^i-n) and h^(-m) are low pass and high pass 
decomposition filters. Blocks are containing a down arrow 
and represent down sampling extracting every other point 
from a sequence of points. Each pass through the filter bank 
in Figure 1 1 decomposes the input signal into four lower 
resolutions (or lower scale) components. The W$ 
coefficients are created by two low pass (h^ based) filters 
and are thus called the approximation coefficients and { W$ l 
for i = H, V, D} are the horizontal, vertical and diagonal 
detail coefficients. 
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Fig. 1 1 2D DWT filter bank 

Mathematically, the series of filtering and down sampling 
operations are used to compute the DWT coefficients W$ 
(j,m,ri) and {Wj(j,m,ri) for i = H,V,D} at scale j. 



<m=2fe)ti>fl (14) 

/(m=2^>0 (15) 

f(m=2§Jc>0 (17) 

In Figure 12, f(x,y) is the highest resolution representation 
of the image being transformed. It serves as the input for 
the first iteration and for the succeeding iterations; the 
approximation coefficients W^(j, m, ri) are given as input to 
the filter bank, to obtain the next set of wavelet coefficients. 
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Fig. 12 Sub-band image decomposition for wavelet based 
feature extraction 



Thus the energy for each subband is calculated up to 4 
levels of decomposition and the image features Et, Eh, Ev 
and Ed are obtained from the energy map which is 
determined using tree-structured wavelet transform for each 
image. Few Sample enhanced machine images [Figure 
13(a) to 18(a)] are applied with DWT and the respective 
transform outcomes are shown in [Figure 13(b) to 18(b)] 
along with the energy details in Table 1 . 




Figure 13(a) 
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Figure 16(a) 



Figure 13(b) Transform absolute coefficient 




Figure 14(a) 
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Figure 14(b) Transform absolute coefficient 




Figure 15(a). 




Figure 16(b) Transform absolute coefficient 




Figure 17(a). 
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Figure 17(b) Transform absolute coefficient 
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Figure 15(b) Transform absolute coefficient 



Figure 18(a) 
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Table 2 ANN estimated R t for milling parameters (Image 
enhancement done and image features extracted using FT) 



Figure 18(b) Transform absolute coefficient 
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Fig. 
Name 


Ea 
Values 


Edetail Values 


Et 


Eh 


Ev 


Ed 


13(a) 


99.0286 


0.0396 


0.1714 


0.4482 


0.3122 


14(a) 


99.0088 


0.0204 


0.1183 


0.3862 


0.4663 


15(a) 


97.7414 


0.0272 


0.2686 


0.6214 


1.3414 


16(a) 


98.5546 


0.0393 


0.3785 


0.7579 


0.2697 


17(a) 


98.5984 


0.0606 


0.2933 


0.5217 


0.5260 


18(a) 


96.8104 


0.0282 


0.1311 


0.5938 


2.4364 



Where Et is Energy total, Eh is Energy horizontal, Ev is 
Energy Vertical and Ed is Energy diagonal. Ea is Energy 
Approximation. 

Case 2: Estimation of R t using ANN 
(a) For Milled surfaces 

Two types of feature extraction and surface roughness 
estimation using ANN is performed in this work. The first 
one extracts the features using FT and the second uses the 
WT. In FT approach the key input features collected for 
training the network consist of 
(i) average grey scale value (Ga) 
(ii) major peak frequency (Fl) and 
(iii) Principal component magnitude squared value (F2). 
The WT based feature extraction is already discussed in 
case (i) of section VI. In the training phase (for both FT and 
WT) the desired value of the node in the output layer is the 
surface roughness R t obtained using the stylus method. The 
surface roughness R t from ANN along with the stylus 
measurement values for the milled samples after image 
enhancement with FT (WT) extracted features is given in 
Table 2 (Table 3). 



Test 

No. 


Machining conditions 


Features of image texture 


Stylus 


Vision 


Error 


V 

(m/s) 


F 

(m/rev) 


D 

(mm) 


G a 


Fi 


F 2 


(urn) 


R* 

(urn) 


1 


250 


40 


1.6 


0.5332 


0.5552 


0.6872 


0.3016 


0.31 


0.0084 


2 


500 


40 


1.6 


0.6605 


0.5846 


0.8016 


0.1382 


0.31 


0.1718 


3 


500 


80 


1.6 


0.5834 


0.6537 


0.7892 


0.2823 


0.47 


0.1877 


4 


1000 


40 


0.4 


0.5686 


0.4686 


0.8589 


0.0877 


0.20 


0.1123 


5 


1000 


SO 


0.8 


0.5626 


0.4342 


0.9080 


0.0567 


0.17 


0.1133 


6 


1000 


160 


0.4 


0.5844 


0.4326 


0.8979 


0.0528 


0.17 


0.1172 


7 


1000 


160 


0.8 


0.6898 


0.5864 


0.9158 


0.0260 


0.13 


0.104 


8 


2000 


160 


0.4 


0.8307 


0.8248 


0.9443 


0.0281 


0.17 


0.1419 


9 


355 


80 


1 


0.4782 


0.2627 


0.6878 


0.1217 


0.07 


0.0517 


10 


500 


80 


1 


0.4858 


0.2524 


0.7814 


0.0606 


0.08 


0.0194 


11 


710 


500 


1 


0.8218 


0.8119 


0.2649 


0.8486 


0.83 


0.0186 


12 


1400 


500 


1 


0.9804 


0.9642 


0.6850 


0.4649 


0.37 


0.0949 



Table 3 ANN estimated R t for milling parameters (Image 
enhancement done and image features extracted using WT) 



Test 

No. 


Machining conditions 


features of image texture 


Stylus 
RtQim) 


Vision 
R,(um) 


Error 


V 

(m/s) 


F 

(m/rev) 


(mm) 


Et 


Eh 


E v 


Ed 


1 


250 


40 


16 


0.9673 


0.9085 


0.7011 


0.9644 


0.3016 


0.28 


0.0216 


2 


500 


40 


1.6 


0.9800 


0.8131 


0.8597 


0.3587 


0.1382 


0.13 


0.0082 


3 


500 


80 


1.6 


0.9767 


0.7354 


0.5408 


0.6609 


0.2823 


0.27 


0.0123 


4 


1000 


40 


0.4 


0.9870 


0.8531 


0.5585 


0.2162 


0.0877 


0.07 


0.0177 


5 


1000 


80 


0.8 


0.9918 


0.5923 


0.3659 


0.1265 


0.0567 


0.04 


0.0167 


6 


1000 


160 


0.4 


0.9928 


0.5700 


0.3303 


0.1038 


0.0528 


0.04 


0.0123 


7 


1000 


160 


0.8 


0.9953 


0.4977 


0.2527 


0.0436 


0.0260 


0.05 


0.024 


8 


2000 


160 


0.4 


0.9940 


0.3938 


0.3130 


0.0743 


0.0281 


0.05 


0.0219 


9 


355 


80 


1 


0.9969 


0.1338 


0.1555 


0.0432 


0.1217 


0.11 


0.0117 


10 


500 


80 


1 


0.9968 


0.1438 


0.1688 


0.0404 


0.0606 


0.06 


0.0006 


11 


710 


500 


1 


0.9955 


0.1646 


0.1891 


0.0842 


0.8486 


0.83 


0.0186 


12 


1400 


500 


1 


0.9908 


0.2031 


0.2416 


0.2497 


0.4649 


0.45 


0.0149 



The results obtained are validated by plotting the 
correlation graph between stylus measured (conventional 
method) R r and vision measured (proposed) R t for both the 
FT and WT techniques for milled components is shown in 
Figure 15. 
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Figure 15 Comparison between predicted roughness values 
using vision approach and stylus approach for FT features 
and WT features (milling) 



(b) For grinding operations 

The R t value predicted using the trained ANN and that 
measured using the stylus approach for the grinding 
process after image enhancement with features extracted 
using FT (WT) is given in Table 4 (Table 5). 



Table 4 ANN estimated R t for grinding parameters (Image 
enhancement done and image features extracted using FT) 



Table 5 ANN estimated R t for grinding parameters (Image 
enhancement done and image features extracted using WT) 
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The results obtained are validated by plotting the 
correlation graph between stylus measured (conventional 
method) R r and vision measured (proposed) R t for both the 
FT and WT techniques for grinding components is shown 
in Figure 16. 
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Figure 16 Comparison between predicted roughness values 
using vision approach and stylus approach for FT features 
and WT features (grinding). 

VII. CONCLUSION AND FUTURE ENHANCEMENT. 

The developed model is tested online on images of 
specimens grabbed by computer vision systems with 
linearly decreasing intensity. The features of the grabbed 
enhanced image (to remove noise present in the captured 
image) are extracted using two different schemes, one using 
Fourier transform (FT) and the other using wavelet 



251 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) InternationalJournal of Computer Science and Information Security, 

Vol.8, No. 8, November 2010 



decomposition. The FT method is used to extract the 
features of image texture, namely, the major peak frequency 
F b and the principal component magnitude squared value 
F 2 . Using the wavelet (Db4) multi resolution decomposition 
algorithm, the energy details of the sub band images, 
namely, energy total (E t ), energy horizontal (E h ), energy 
vertical (E v ) and energy diagonal (E d ) are extracted. These 
extracted features of the enhanced image are given as input 
to a trained neural network (back propagation network) and 
the surface roughness parameter R t is estimated. From the 
obtained results, it is concluded that the wavelet based 
image feature extraction of the enhanced images gives 
better correlation between vision R t and the stylus R t both 
for milled and grinding surfaces. 

Future direction of research shall focus on 
implementing the proposed algorithms using high speed 
hardware units thus making the present work ideally for 
high speed real-time machine vision applications. 
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Abstract — Software development includes Requirement 
Engineering (RE) which is comprised of discovering 
stakeholders and their need, proper documentation, 
communication and subsequent implementation. It can be 
viewed as the most crucial phase as the success of the software 
depends largely on it. Requirement Engineering is receiving 
increasing attention of the researchers and also people 
associated with software development. This paper investigates 
RE activities in detail followed by some current challenges and 
also proposes some suggestions to overcome these challenging 
issues. 

Keywords- Software Requirement, Requirement Engineering, 
Requirement Elicitation, Requirement Management. 



I. 



Introduction 



Success of a software system is measured by evaluating 
how it meets the purpose it was intended and in broad sense, 
Requirement Engineering (RE) is the process of discovering 
that purpose through identifying stake holders and their 
needs, refinement of the gathered information, modeling, 
specification and then subsequent implementation. The 
system requirements and role allocated to software — 
initially established by the system engineer — are refined in 
detail. Models of the required data, information and control 
flow, and operational behavior are created. Alternative 
solutions are analyzed and a complete analysis model is 
created [1]. Donald Reifer [2] describes the software 
requirement engineering process in the following way: 

"Requirements engineering is the systematic use 
of proven principles, techniques, languages, and 
tools for the cost effective analysis, 
documentation, and on-going evolution of user 
needs and the specification of the external 
behavior of a system to satisfy those user needs. 
Notice that like all engineering disciplines, 
requirements engineering is not conducted in a 
sporadic, random or otherwise haphazard fashion, 
but instead is the systematic use of proven 
approaches." 

The Requirement Engineering process may face a 
number of inherent difficulties like: stakeholders may be 
numerous and distributed, their goals may be volatile and 
conflicting, depending on their perspectives of different 
working environment, goals may be implicit and difficult to 
articulate, and, inevitably, satisfaction of these goals may be 



constrained by a variety of factors outside their control. 
Based on these characteristics Zave [3] defined RE as: 

"Requirements engineering is the branch of 
software engineering concerned with the real- 
world goals for, functions of, and constraints on 
software systems. It is also concerned with the 
relationship of these factors to precise 
specifications of software behavior, and to their 
evolution over time and across software 
families." 

This definition is an attractive one as it highlights the 
importance of "real world goals" which motivates the 
development of a software system by referring the 
specifications more precisely. It also refers to the reality of a 
changing world and the need to reuse partial specifications, 
as done by engineers in other branches of engineering. 
Brooks [4] defined RE as a key problem area in the 
development of complex, software-intensive systems: 

"The hardest single part of building a software 
system is deciding what to build. ...No other 
part of the work so cripples the resulting system 
if done wrong. No other part is more difficult to 
rectify later." 

RE can be characterized as a branch of System 
Engineering [5] as it has to encompass a systems level view 
to deliver some systems behavior to its stakeholders. Again, 
Zave defined RE as a branch of Software Engineering and 
software systems requirements engineering has received 
special consideration possibly due to the abstract and 
invisible nature of software with the vast range and variety 
of problems that admit to software solutions. 

Whether viewed at the systems level or the software 
level, RE is a multi-disciplinary, human-centered process 
and the use of the term Engineering in RE serves as a 
reminder that RE is an important part of an engineering 
process and represents a series of engineering decisions that 
lead from recognition of a problem to be solved to a detailed 
specification of that problem followed by anchoring 
development activities, so that the appropriateness and cost- 
effectiveness of the solution can then be analyzed. 

The structure of the paper is approximately as follows. In 
sections 2 and 3 we discuss the foundation and some ground 
works needed for requirement engineering respectively. 
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Section 4 discusses about some key issues followed by the 
core activities of RE in section 5. Then, in section 6, 
challenges of RE activities are analyzed with some 
suggestions. We finish by drawing some broad and 
necessarily speculative and personal conclusions about the 
future of requirement engineering. 

II. Foundation 

RE activities play a vital role in software and systems 
engineering, and there are also many disciplines upon which 
it draws. 

In software development context, an important role is 
played by Computer Science as theoretical computer science 
provide the framework to assess the feasibility of 
requirements while the means of developing software 
solution are provided by practical computer science. Here 
logic provides a vehicle for analyzing software behavior 
which is acquiescent to formal reasoning to make the 
reasoning steps more explicit. Different logics may be used 
to express different aspects of a required system. For 
example, temporal logic can be used to describe timing 
information, deontic logic to describe permissions and 
obligations, and linear logic to describe resources and their 
use. A further advantage of specification languages 
grounded in logic is that they are potentially amenable to 
automated reasoning and analysis [6] . 

In the systems engineering context, an understanding 
and application of systems theory and practice is also 
relevant to RE [5]. This includes work on characterizing 
systems, identifying their boundaries and managing their 
development life cycle [7-8]. RE also encompasses work on 
systems analysis, traditionally found in the information 
systems world [9]. 

Usually RE activities take place in a human activity 
system, and people are the problem owner. Therefore, RE 
needs to be sensitive to how people perceive and understand 
the newly implemented computer-based system, how they 
interact, and how the environment of the workplace is 
affected by their actions. RE draws on the cognitive and 
social sciences to provide both theoretical grounding and 
practical techniques for eliciting and modeling requirements 
[6]. Cognitive psychology provides an understanding of the 
difficulties people may have in describing their needs [10]. 
Anthropology provides a methodological approach to 
observing human activities that helps to develop a richer 
understanding of how computer systems may help or hinder 
those activities [11]. Sociology provides an understanding of 
the political and cultural changes caused by computerization. 
Introduction of a new computer system changes the nature of 
the work carried out within an organization, may affect the 
structure and communication paths within that organization, 
and may even change the original needs that it was built to 
satisfy [12]. Linguistics is important because RE is largely 
about communication. Linguistic analyses have changed the 
way in which the English language is used in specifications, 
for instance to avoid ambiguity and to improve 
understandability. Tools from linguistics can also be used in 



requirements elicitation, for instance to analyze 
communication patterns within an organization [13]. 
Philosophical elements also have an important effect on RE 
as it is concerned with interpreting and understanding of 
terminology, concepts, viewpoints and goals from 
stakeholder's perspective. Such issues become important 
while requirement validation, especially where stakeholders 
may have divergent goals and incompatible belief systems. 
They also become important in selecting a requirement 
modeling technique, because the choice of technique affects 
the set of phenomena that can be modeled, and may even 
restrict what a requirements engineer is capable of observing 
[6]. 

III. Groundwork 

In the software development process, RE is considered 
as a front-end activity. Although it is usually the case that 
requirements change during development and evolve after a 
system has been in operation for some time, RE plays an 
important role in the management of changes in software 
development. Nevertheless, the bulk of the effort of RE does 
occur early in the lifetime of a project, motivated by the 
evidence that requirements errors, such as misunderstood or 
omitted requirements, are more expensive to fix later in 
project lifecycles [14-15]. 

Before a project can be started, some preparation is 
needed which are categorized as context and groundwork by 
Finkelstein [16]. This groundwork includes some 
assessment of project's feasibility and associated risks needs 
to be undertaken, and RE plays a crucial role in making 
such an assessment. Although it is often possible to estimate 
project costs, schedules and technical feasibility from 
precise specifications of requirements, risk should be re- 
evaluated regularly throughout the development lifetime of 
a system [17], since changes in the environment can change 
the associated development risks. 

Again, RE activities are performed in a variety of 
contexts, including market-driven product development and 
development for a specific customer with the eventual 
intention of developing a broader market. The type of 
product also affects the choice of method: RE for 
information systems is very different from RE for embedded 
control systems, which is different again from RE for generic 
services such as networking and operating systems [6]. So 
groundwork is essential for the identification of a suitable 
process and also for the selection of suitable methods and 
techniques for the various RE activities. 

IV. Key Issues in Requirement Engineering 

Software development organizations should keep in 
mind the following issues when they consider how to 
improve the requirements and communication for their 
projects: 

• If teams don't get requirements right, it doesn't 
matter how well they execute the rest of the 
project: The goal of every software development 
project is to build a product that provides value to 
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customers. Effective requirements definition 
enables teams to determine the mix of product 
capabilities and characteristics that will best deliver 
this customer value. An understanding evolves 
over time as stakeholders provide feedback on the 
early work and refine their expectations and needs. 
Adequately exploring and crafting requirements 
into a set of product features and attributes helps to 
ensure customer needs are being met throughout 
the project lifecycle [18]. 

Requirements definition is a discovery and 
invention process, not just a collection process: 
Teams often talk about "gathering requirements." 
This phrase suggests that requirements are just 
lying around waiting to be picked like flowers or to 
be plucked out of users' brains by an analyst. In 
reality, requirements definition is an exploratory 
activity, and requirements elicitation is a more 
accurate description than requirements gathering. 
Elicitation includes some discovery and some 
invention, as well as recording those bits of 
requirements information that various stakeholders 
present to an analyst. Elicitation demands iteration. 
Constant feedback and validation from 
stakeholders keeps communication flowing. 
Participants in a requirements elicitation discussion 
will not think of everything they will need up front, 
and their thinking will change as a project 
progresses. Teams that prepare to iterate most often 
elicit the most accurate requirements [18]. 
Customer involvement is the most critical 
contributor to software quality: Various studies 
confirm that inadequate customer involvement is a 
leading cause of failure of software projects. The 
development team will get the customer input it 
needs eventually - even if it is after a project ships. 
However, it is much cheaper - and much less 
painful - to get customer input earlier, rather than 
after product release. Customer involvement 
requires more than a workshop or two early in the 
project. It involves input from customers early and 
often in the requirements process. Ongoing 
engagement by suitably empowered and 
enthusiastic stakeholders is a critical success factor 
for software development [18]. 
Change happens; managing change is critical: It 
is inevitable that requirements will change as 
business needs evolve, new users or markets are 
identified, business rules and government 
regulations are revised and operating environments 
change. The objective of a change control process 
is not to inhibit change. Rather, the objective is to 
manage change to ensure that the project 
incorporates the right changes for the right reasons. 
Teams that anticipate and accommodate changes 
minimize disruption and cost to the project and its 



stakeholders. Further teams that can force as much 

change at the beginning of a project will have less 

change to manage over time [18]. 

• Teams are never going to have perfect 

requirements: Requirements are never finished or 

complete. There is no way to know for certain that 

teams have not overlooked some requirement, and 

there will always be some requirements that are not 

in the specification. It is also folly to think teams can 

freeze the requirements and allow no changes after 

some initial elicitation activities. Rather than 

declaring requirements "done" at some point, 

effective teams define a baseline then follow a 

sensible change control process to modify 

requirements once a baseline is established [18]. 

V. Core Activities of Requirement Engineering 

Good RE activities can accelerate software 
development. The process of defining business requirements 
aligns the stakeholders with shared vision, goals and 
expectations. Involvement of substantial user in establishing 
and managing changes to agree upon requirements increases 
the accuracy of requirements, ensuring that the functionality 
of the developed system will enable users to perform their 
essential business tasks. 

In a broad sense, Requirement Engineering 
encompasses the two major sub domains of requirement 
definition and requirement management. 

Requirement Definition is the collaborative process of 
collecting, documenting and validating a set of requirements 
that constitute an agreement among key project 
stakeholders. This phase can be further subdivided into the 
critical process areas of elicitation, analysis, specification, 
agreeing and validating requirements. 

From a pragmatic perspective, requirement definition 
strives for requirements which are validated by user and 
clear enough to the software development team to proceed 
with design, construction and testing at an acceptable level 
of risk. It is natural that, risk leads to the threat of doing 
expensive and unnecessary rework. 

Requirement Management involves working with a 
defined set of requirements throughout the development 
process of the system and its operational life. It also allows 
managing changes to that set of requirements throughout the 
project lifecycle. This phase includes selecting changes to 
be incorporated within a particular release and ensuring 
effective implementation of changes with no adverse impact 
on schedule, scope or quality of the developed system. 

An effective requirement definition and management 
solution creates an accurate and complete set of system 
requirements, while helping organizations to improve 
communications is an effort to better align it with business 
needs and objectives. The following sub sections discuss the 
core activities of RE. 
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A. Requirement Elicitation 

Requirements elicitation is recognized as one of the 
most critical, knowledge-intensive activities of software 
development [19]; poor execution of elicitation will lead a 
complete failure of the project. Since project failures are so 
rampant [20], it is quite likely that improving how the 
industry performs elicitation could have a dramatic effect on 
the success record of the industry [21]. Information 
collected during this requirement elicitation phase are 
interpreted, analyzed, modeled and validated so that a 
requirement engineer can feel confident that a complete 
enough set of requirements of a system have been collected. 
Therefore, requirements elicitation is closely related to other 
RE activities - to a great extent, selection of techniques for 
the next phases largely depends on how requirements were 
elicited. Steps in requirement elicitation phase are described 
below: 

• Defining System Boundaries: One of the key 
objectives of requirement elicitation is to define 
system boundaries i.e., to find out the problems 
that need to be solved. This definition is necessary 
to discover where the final delivered system will fit 
into the current operational environment. Without a 
clearly defined system boundary, the project is 
issuing an open invitation to scope creep. Prior to 
eliciting requirements, teams should have a clear 
understanding of system boundaries as it affects all 
subsequent elicitation efforts. 

• Defining Goals: Goals denote the objectives that a 
system should meet. Eliciting high level goals in 
the early stage of the development process is 
crucial. However, goal-oriented requirements 
elicitation [23] is an activity that continues as 
development proceeds, as high-level goals (such as 
business goals) are refined into lower level goals 
(such as technical goals that are eventually 
operational in a system) [6]. Eliciting goals 
emphasizes on defining the problem domain and 
also the needs of the stakeholders, rather than on 
possible solutions to those problems. 

• Identifying Stakeholders: Stakeholders mean 
individuals or organizations that stand to gain or 
lose from the success or failure of a system. 
Stakeholders include customers or clients (who pay 
for the system), developers (who design, construct 
and maintain the system), and users (who interact 
with the system to get their work done) [6]. It is 
essential to gain commitment from key 
stakeholders for their participation throughout the 
requirements definition and Customer engagement 
is necessary during requirements management, as 
well. [18]. Stakeholders' view is required in 
making change in decisions followed by assessing 
the impact of proposed changes and adjusting 
requirement priorities. Users themselves are not 



homogeneous, and hence part of the elicitation 
process is to identify the needs of different user 
classes, such as novice users, expert users, 
occasional users, disabled users, and so on [22]. So 
initially every software project should identify its 
key requirement decision makers as well as the 
decision-making process to ensure that the right 
people can make important and timely decisions. 

• Select Elicitation Technique: The choice of 
elicitation technique depends on the time and 
resources available to the requirements engineer, 
and of course, the kind of information that needs to 
be elicited [6]. It also depends on the extent of 
stakeholder involvement and how much access the 
analyst has to the stakeholders. To interact with 
stakeholders, requirement engineer can use 
techniques like: workshops, questionnaires, and 
interviews. Now a day, it is a common practice that 
teams want to interact with stakeholders in 
lightweight, dynamic, frequent way which is 
completely focused on the problem and thus 
collaboration has taken priority over 
documentation review. Techniques that leverage 
prototypes, mockups and screenshots are becoming 
the norm. However, for the sake of a dynamic 
development process, teams should be trained and 
proficient in a variety of elicitation techniques. 

• Exploring user scenarios and simulations: It is 
often the case that users find it difficult to articulate 
their requirements. Rather they find it more 
convenient to visually describe their business tasks, 
interaction patterns and expected product 
functionality than to define all these textually. So a 
requirements engineer can resort to eliciting 
information about the tasks currently performed by 
the users and those that they might want to perform 
[24]. After that, these tasks can be represented in 
use cases which can be used to describe the 
outwardly visible requirements of systems [25]. 
More specifically, the requirements engineer may 
choose a particular path through a use case, a 
scenario, in order to better understand some aspect 
of using a system [26]. Acceleration in visual 
techniques for requirements definition and the 
ability to tie artifacts to more traditional 
requirements is changing the way of interactions 
forming between business and development 
organizations. 

A-l. Elicitation Techniques: 

Researchers found various classes for requirement 
elicitation techniques all of which are apposite in different 
scenarios. Some of these are briefly discussed below: 

• Traditional Techniques: These techniques are 
useful for gathering generic data. Questionnaires 
and surveys, interviews, and analysis of existing 
documentation (ex: organizational charts, process 
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models or standards, and manuals of existing 
systems) belong to this group. 

• Elicitation through Groups: These techniques are 
useful for a richer understanding of need through 
exploiting team dynamics with an aim to foster 
stakeholders' need. This class includes techniques 
like brainstorming and focus groups, as well as 
RAD/JAD workshops (using consensus-building 
workshops with an unbiased facilitator) [27]. 

• Prototyping: These techniques are useful in such 
cases where there remains a great deal of 
uncertainty about the requirements, or where early 
feedback from stakeholders is needed [28]. For 
better requirement elicitation, prototyping can also 
be readily combined with other techniques, for 
instance by using a prototype to provoke discussion 
in a group elicitation technique or as the basis for 
other traditional techniques. 

• Model Driven Techniques: Techniques of this class 
is helpful in the cases where a specific model of the 
type of information to be gathered is already 
provided and this model is used to drive the 
elicitation process. Goal-based methods, such as 
KAOS [29] and I* [30], and scenario-based 
methods such as CREWS [31] belong to this class. 

• Cognitive Techniques: This group includes a series 
of techniques originally developed for knowledge 
acquisition for knowledge-based systems [32]. 
Such techniques include protocol analysis (in 
which an expert thinks aloud while performing a 
task, to provide the observer with insights into the 
cognitive processes used to perform the task), 
laddering (using probes to elicit structure and 
content of stakeholder knowledge), card sorting 
(asking stakeholders to sort cards in groups, each 
of which has name of some domain entity), 
repertory grids (constructing an attribute matrix for 
entities, by asking stakeholders for attributes 
applicable to entities and values for cells in each 
entity) [6]. 

• Contextual Techniques: This class is an alternative 
to both traditional and cognitive techniques [33]. 
These include the use of ethnographic techniques 
like participant observation, ethnomethodogy and 
conversation analysis, which result fine grained 
analysis to identify patterns in conversation and 
interaction [34]. 

Though there is a fundamental methodological disagreement 
between the proponents of contextual techniques on the one 
hand, and the traditional and cognitive techniques on the 
other, recent work has focused on the question of whether 
integration is possible [34-35]. 

B. Requirement Analysis 

Requirements Analysis is a fundamental activity in RE 
that bridges the gap between system level requirements 



engineering and software design. It allows the requirement 
engineer to refine the software allocation and build models 
of the data, functional, and behavioral domains that will be 
exploited by software. It is also beneficial for the software 
designer as it provides a representation of information, 
function, and behavior that can be translated to data, 
architectural, interface, and component-level designs. 
Finally, requirements specification after successful 
requirement analysis provides the developer as well as the 
customer with the means to assess quality once software is 
built. The subsequent steps should be followed during 
requirement analysis: 

• Creation of visual scenarios: The natural language 
requirements found in most specifications are 
mostly text based, full of ambiguities, redundancies 
and gaps. Most of the cases, it is desirable to 
represent requirements in multiple ways to give 
readers a richer, more holistic understanding. 
Visual scenarios present requirements information 
from a business viewpoint in graphical diagrams 
which allow reviewers to immediately spot missing 
requirements by examining flows, rather than 
uncovering missing requirements by reading a 
opaque textual specification. Teams may have 
better results using diagrams that communicate at a 
higher level of abstraction, so readers can get the 
big picture without getting mired in all of the 
details. 

• Creation and Evaluation of Simulations: A 
simulation is an interactive software experience 
that captures the essential flow and level of detail 
to the requirements. Simulations provide 
opportunities for everyone to interact with some 
portion of the final system which is more tangible 
than written requirements specifications. A 
simulation may look and behave like a prototype, 
but the key difference is that a simulation is not 
created by the development team. It is created as 
part of the requirements definition phase and is 
typically done without requiring any development 
skills. Simulations can leverage existing business 
data, show flow of the data and dynamically 
capture feedback that can quickly be incorporated 
into the next revision. Typically, revisions can be 
augmented during the stakeholder review - leading 
to an "Is this what you had in mind?" high quality 
interaction [18]. 

• Requirement Prioritization: The ultimate goal of a 
software development team is to create a system 
that meets the stakeholders' demands. Since there 
are usually more requirements than can be 
implemented within the limited resource, decision 
makers must face the dilemma of selecting the 
right set of requirements for the intended system. 
In order to select the correct set of requirements, 
the decision makers must understand the relative 
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priorities of the requested requirements [36]. By 

selecting a subset of the requirements that are 

valuable for the customers, and can be 

implemented within budget, organizations can 

become more successful on the market. However, 

Prioritization should be a collaborative process that 

involves both a customer and a technical 

perspective to balance customer value against cost 

and technical risk. 

Analysis techniques that have been investigated in RE 

include requirements animation [37], automated reasoning 

(e.g., analogical and case-based reasoning [38] and 

knowledge based critiquing [39]), consistency checking (e.g., 

model checking [40]), and a variety of techniques for 

validation and verification. 

C. Requirement Specification 

Software Requirement Specification (SRS) is basically 
an organization's understanding (in writing) of a customer or 
potential client's system requirements and dependencies at a 
particular point in time (usually) prior to any actual design or 
development work. It's a two-way insurance policy that 
assures that both the client and the organization understand 
the other's requirements from that perspective at a given 
point in time. The SRS document itself states in precise and 
explicit language those functions and capabilities a software 
system must provide, as well as states any required 
constraints by which the system must abide. The SRS also 
officiates as a blueprint for completing a project with as little 
cost growth as possible. The SRS is often referred to as the 
"parent" document because all subsequent project 
management documents, such as design specifications, 
statements of work, software architecture specifications, 
testing and validation plans, and documentation plans, are 
related to it. It is important to note that an SRS contains 
functional (requirements that define a function of a software 
system or its component) and nonfunctional requirements 
(requirements that specify criteria that can be used to judge 
the operation of a system, rather than specific behaviors) 
only; it doesn't offer design suggestions, possible solutions to 
technology or business issues, or any other information other 
than what the development team understands the customer's 
system requirements to be. There are some standard for SRS 
defined by IEEE e.g. IEEE STD 830-1998 [41] and IEEE 
STD 1233-1998 [42]. 

C-l. Goals of SRS 

A well-designed, well-written SRS accomplishes four 
major goals: 

• It provides feedback to the customer. An SRS is 
the customer's assurance that the development 
organization understands the issues or problems to 
be solved and the software behavior necessary to 
address those problems. Therefore, the SRS should 
be written in natural language, in an unambiguous 
manner that may also include charts, tables, data 
flow diagrams, decision tables, and so on. 



• It decomposes the problem into component parts. 
The simple act of writing down software 
requirements in a well-designed format organizes 
information, places borders around the problem, 
solidifies ideas, and helps break down the problem 
into its constituent parts in an orderly fashion. 

• It serves as an input to the design specification. As 
mentioned previously, the SRS serves as the parent 
document to subsequent documents, such as the 
software design specification and statement of 
work. Therefore, the SRS must contain sufficient 
detail in the functional system requirements so that 
a design solution can be devised. 

It serves as a product validation check. The SRS also acts 
as the parent document for testing and validation strategies 
that will be applied to the requirements for verification and 
validation. 

C-2. What should the SRS address? 

IEEE standards suggest that SRS should address the 
following basic issues: 

• Functionality: What is the software supposed to 
do? 

• External interfaces: How does the software interact 
with people, the system's hardware, other 
hardware, and other software? 

• Performance: What is the speed, availability, 
response time, recovery time of various software 
functions, etc.? 

• Attributes: What are the portability, correctness, 
maintainability, security, etc. considerations? 

• Design constraints imposed on an implementation: 
Are there any required standards in effect, 
implementation language, policies for database 
integrity, resource limits, operating environment(s) 
etc.? 

C-3. Characteristics of a good SRS 

A good SRS should contain the following 
characteristics: 

• Complete: SRS should define precisely all the go- 
live situations that will be encountered and the 
system's capability to successfully address them. It 
should contain everything that is needed by the 
software designers to develop the software. 

• Correct: Of course everyone associated with the 
intended system expects the specification to be 
correct. No one writes a specification that they 
know is incorrect. Someone may like to say it - 
"Correct and Ever Correcting." The discipline is 
keeping the specification up to date when someone 
finds things that are not correct. 

• Consistent: The SRS should be consistent within 
itself and consistent to its reference documents. If 
someone calls an input "Start and Stop" in one 
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place, he/she shouldn't call it "Start/Stop" in 
another. 

• Accurate: SRS should precisely define the system's 
capability in a real-world environment, as well as 
how it interfaces and interacts with it. This aspect 
of requirements is a noteworthy problem area for 
many SRSs. 

• Modifiable: The logical, hierarchical structure of 
the SRS should facilitate any necessary 
modifications; grouping related issues together and 
separating them from unrelated issues makes the 
SRS easier to modify. 

• Ranked: Individual requirements of an SRS should 
be hierarchically arranged according to stability, 
security, perceived ease/difficulty of 
implementation, or other parameter that helps in 
the design of that and subsequent documents. 

• Unambiguous: SRS must contain requirements 
statements that can be interpreted in one way only. 
This is another area that creates significant 
problems for SRS development because of the use 
of natural language. 

• Testable: SRS must be stated in such a manner that 
unambiguous assessment criteria (pass/fail or some 
quantitative measure) can be derived from the SRS 
itself. 

• Traceable: Requirements traceability (RT) is 
another major factor that determines how easy it is 
to read, navigate, query and change requirements 
documentation [6]. SRS must be able to link each 
software functional requirement back to its origin, 
possibly a use case or business rule. Teams should 
embrace traceability information that connects 
functional requirements to associated design 
elements, code segments and tests to accelerate 
debugging and software maintenance. As RT lies at 
the heart of requirements management practice, 
providing RT in SRS is a means of achieving 
integrity and completeness of requirement 
documentation that has an important role to play in 
managing changes. 

• Valid: A valid SRS is one in which all parties and 
project participants can understand, analyze, 
accept, or approve it. This is one of the main 
reasons SRSs are written using natural language. 

• Verifiable: A verifiable SRS is consistent from one 
level of abstraction to another. Most attributes of a 
specification are subjective and a conclusive 
assessment of quality requires a technical review by 
domain experts. Using indicators of strength and 
weakness provide some evidence whether preferred 
attributes are present or not. 



C-4. Benefits of a good SRS 

The IEEE 830 standard defines the benefits of a good 
SRS [41]: 

• Establish the basis for agreement between the 
customers and the suppliers on what the software 
product is to do: The complete description of the 
functions to be performed by the software specified 
in the SRS will assist the potential users to 
determine if the software specified meets their 
needs or how the software must be modified to 
meet their needs. 

• Reduce the development effort: The preparation of 
the SRS forces the various concerned groups in the 
customer's organization to consider rigorously all 
of the requirements before design begins and 
reduces later redesign, recoding, and retesting. 
Careful review of the requirements in the SRS can 
reveal omissions, misunderstandings, and 
inconsistencies early in the development cycle 
when these problems are easier to correct. 

• Provide a basis for estimating costs and schedules: 
The description of the system to be developed as 
given in the SRS is a realistic basis for estimating 
project costs and can be used to obtain approval for 
bids or price estimates. 

• Provide a baseline for validation and verification: 
Teams can develop their validation and 
Verification plans much more productively from a 
good SRS. As a part of the development contract, 
the SRS provides a baseline against which 
compliance can be measured. 

• Facilitate transfer: SRS makes it easier to transfer 
the software product to new users or new 
machines. Customers thus find it easier to transfer 
the software to other parts of their organization, 
and suppliers find it easier to transfer it to new 
customers. 

• Serve as a basis for enhancement: Because the SRS 
discusses the system but not the project that 
developed it, the SRS serves as a basis for later 
enhancement of the developed system. The SRS 
may need to be altered, but it does provide a 
foundation for continued system evaluation. 

D. Agreeing and Validating Requirements 

It is an arduous job to maintain agreement with all 
stakeholders as they have divergent goals. Validation is the 
process of establishing that the requirements elicited and 
specified provide an accurate account of stakeholder 
requirements. This activity is essential for resolving 
conflicts between stakeholders. 

Techniques such as inspection and formal analysis tend 
to concentrate on the consistency and completeness of the 
requirements. This approach is illustrated by SCR [43] 
which provides an automated checking of the formal model 



259 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol 8, No. 8, November 2010 



for syntactic consistency and completeness. On the other 
hand, techniques like prototyping, specification animation, 
and the use of scenarios are geared towards testing a 
correspondence with the real world problem: have all the 
aspects of the problem that the stakeholders regard as 
important been covered? 

In some cases, the methods and tools used by the 
requirement engineers dominate the way that they see and 
describe problems, which, in the extreme case, shifts the 
problem of validating requirements statements to a problem 
of convincing stakeholders that the chosen representation 
for requirements models is appropriate. This leads to the 
realization that observation is not value free, rather it is 
theory-driven, and is biased by the current paradigm. 
Jackson captures this perspective through his identification 
of problem frames [44]. If stakeholders do not agree with 
the choice of problem frame, it is unlikely that they will 
ever agree with any statement of the requirements. 
Ethnomethodologists attempt to avoid the problem 
altogether, by refusing to impose modeling constructs on the 
stakeholders [33]. By discarding traditional problem 
analysis tools, they seek to apply value-free observations of 
stakeholder activities, and therefore circumvent the 
requirements validation issue altogether. 

Requirements negotiation attempts to resolve conflicts 
on goal among stakeholders without necessarily weakening 
satisfaction of each stakeholder. Early approaches to 
requirements negotiation focused on modeling each 
stakeholder's contribution separately rather than trying to fit 
their contributions into a single consistent model [45] and 
on the importance of establishing common ground [46]. 
Boehm introduced the win-win approach [47] in which the 
win conditions for each stakeholder are identified, and the 
software process is managed and measured to ensure that all 
the win conditions are satisfied, through negotiation among 
the stakeholders. These negotiation models concentrate on 
the identification of the most important goals of each 
participant, and ensure that these goals are met. This 
approach is also used in other RE techniques for promoting 
agreement, without necessarily making the goals explicit 
[48]. For example, in Quality Function Deployment (QFD) 
[49], matrices are constructed to compare functional 
requirements with one another and rate their importance 
rather explicitly identifying stakeholder goals. 

After agreeing of the requirements, teams should begin 
"testing" as soon as they have requirements in hand. 
Deriving test cases from use cases and scenarios is a valuable 
way to find problems in the use cases themselves, in 
functional requirements derived from the use cases and in 
analysis models created from the requirements [18]. 

E. Requirement Management 

Managing change is a fundamental activity in RE [50] 
as every successful software system evolves due to change 
in the environment it is operating as well as change in the 



requirement of stakeholders. Requirement Management 
should go through the following steps: 

• Manage Requirements Versions: As requirements 
evolve during the course of a project, it is 
important to track the different versions of 
requirements specification documents and even 
individual requirements [18]. Changes to 
requirements documentation need to be managed 
properly and minimally, this involves providing 
techniques and tools for configuration management 
and version control [51], and by exploiting 
traceability, the impact of changes in different parts 
of the documentation can be monitored and 
controlled correctly. Version tracking helps to 
ensure that all team members are working from the 
latest requirements baseline. 

• Establishing A Change Control Process: Typically, 
changes to requirement specifications include 
adding or deleting requirements, and also fixing 
errors. Requirements are added in response to 
changing stakeholder needs that were missed in the 
initial analysis. Requirements are deleted usually in 
the circumstances to forestall cost and schedule 
overruns. Again there also remain some 
inconsistencies in requirements which arise both as 
a result of mistakes and because of conflicts 
between requirements. In any case, managing 
inconsistency in requirements specifications as 
they evolve is a major challenge [52]. So once 
requirements have been base lined, proposed 
modifications in them should follow an established 
change control process which provides consistency 
in the way requirement changes are proposed, 
evaluated, approved or rejected, communicated to 
stakeholders and implemented in affected work 
products. Teams should have formal written 
change control processes in place before eliciting 
requirements [18]. 

• Perform Requirements Change Impact Analysis: 
Requirement Management not only includes 
process of managing documentation but also 
process of recognizing change through continued 
requirements elicitation, re-evaluation of risk, and 
evaluation of systems in their operational 
environment. Thus, each proposed change needs to 
be evaluated in terms of existing requirements and 
architecture so that the trade-off between the cost 
and benefit of making a change can be assessed. 

Finally, the process of identifying core requirements in 
order to develop architectures that are (a) stable in the 
presence of change, and (b) flexible enough to be customized 
and adapted to changing requirements, is one of the key 
research issues in software engineering [53]. 



260 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol 8, No. 8, November 2010 



VI. Requirement Engineering: Challenges and 
Suggestions 

As mentioned earlier, Requirement Engineering is a 
core process as well as most crucial part in software 
development life cycle. Bugs in requirements may not be 
identified during development phase rather they remain 
concealed until system becomes operational and customer 
requirements are not met. Poor requirements cause not only 
modifications in requirement specifications but also require 
re-designing, re-implementing and retesting for entire 
software. Therefore, requirement engineers have to struggle 
and conquer uncountable numbers of challenges for 
developing effective and efficient software. 

Anticipating requirement engineering challenges will 
grant opportunities for requirement engineers to enhance 
software success rate. There have been many investigations 
conducted to explore different challenges in various 
domains of requirement engineering. Some suggestions to 
overcome these challenges are discussed below: 

• Elicitation: Reduce Rework by Capturing Better 
Information 

The process of capturing requirements in 
context, requirements elicitation includes 
identifying key business and technical 
stakeholders, getting commitment to stakeholder 
involvement, selecting appropriate elicitation 
techniques and capturing requirements and 
scenarios in a simple to understand form. For 
reducing rework, teams should mature their 
existing requirements elicitation process by helping 
to define responsibilities and stakeholders, identify 
appropriate elicitation techniques and train team 
members to use the right techniques. They should 
also take initiative to capture user scenarios in a 
simple, visual form that is easy to understand by 
the user and accelerate elicitation with interactive 
simulations. Enhancement in communication and 
collaboration among distributed teams throughout 
the project lifecycle is also necessary for better 
requirement elicitation. Teams should also improve 
alignment between business expectations and 
project deliverables, which increases end user 
satisfaction and reduces rework from incorrect or 
incomplete requirements. 

• Analysis: Reduce Time to Market with More 
Effective Collaboration 

Requirements analysis involves verifying, 
estimating and prioritizing newly captured 
requirements for remaining application lifecycle 
steps. To reduce time to market, teams should 
codify their existing requirements analysis process 
by implementing an effective approach to evaluate 
and prioritize requirements for specification, 
design, construction and testing. The process is 
improved through the deployment of tools and 



techniques that swell efficiency and accuracy. By 
following these steps, teams can gain better 
estimation and thus, superior predictability for 
software delivery. 

• Specification: Improve Quality through More 
Effective Communications 

Requirements specification includes adding 
detail to requirements incrementally to the optimal 
level for validation, design, coding, testing and 
documentation. To ameliorate quality, teams 
should formulate and automate their existing 
requirements specification process by defining a 
standard hierarchy of requirement types and 
developing standard templates to ensure 
completeness. They should also identify various 
specification techniques and apply technology for 
capturing requirements in a meaningful, easy-to- 
understand way. Traceability across the lifecycle 
allows teams for better prediction of the impact of 
change and proactively communicates those 
changes to all team members affected. Through 
these steps, software defects can be reduced nicely 
because in this case development teams have a 
better understanding of requirements. 

• Agreeing and Validation: Improve Accuracy and 
Completeness of Requirements 

Requirements validation involves verifying 
whether the specification is inclusive and clear 
enough for the development team to understand 
exactly what it needed. It also includes validating 
whether the requirement of the key stakeholders 
are consistent with the original need and intent of 
the business. To improve accuracy, teams should 
mature their existing process by automating 
validation and verification to drive adoption and 
enforcement and improving consistency and 
quality through interactive simulations and 
storyboarding of visual scenarios. These steps trim 
down software defects, increase satisfaction and 
alignment with business stakeholders and thus 
enhance business value. 

• Management: Reduce Costs through Improved 
Change Management 

The process of gathering and managing change 
requests during the application lifecycle, 
requirements management also includes selecting 
changes to be incorporated within a particular 
release and ensuring effective implementation of 
changes with no adverse impact on schedule, scope 
or quality. To reduce costs, teams should contrive 
their existing requirements management process by 
establishing processes for defining and maintaining 
requirements baselines and defining a standard 
process for requesting changes. They may also 
establish a systematic approach to evaluate and 
approve change requests so that scope changes and 
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affected commitments are managed. By improving 
ongoing change management, maximizing business 
impact, while minimizing schedule and scope 
impact, satisfaction of business stakeholders can be 
increased. 

VII. Conclusion 

RE is often treated as a time-consuming, bureaucratic and 
contractual process and due to ineffective RE; customers 
may not be satisfied with the developed system. This attitude 
is changing as RE is increasingly recognized as a critically 
important activity in the field of Software Engineering. The 
novelty of many software applications, the speed with which 
they need to be developed, and the degree to which they are 
expected to change, all play a role in determining how the 
systems development process should be conducted [6]. In 
future, RE will continue to evolve in order to deal with 
different development scenarios for further amelioration. In 
this paper we discussed RE in detail with challenges and 
some suggestions. We believe that effective RE will continue 
to play a key role to ensure success of a project in developing 
quality software. 
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Abstract — License Plate Recognition (LPR) systems play an 
important role in intelligent transportation applications. These 
systems have extensively been used in highway and bridge 
charge, port, airport gate monitoring, parking and toll 
applications, to mention a few. We propose here an automatic 
license plate detection and recognition system for GCC countries 
license plates containing Arabic letters and numerals. The system 
introduces a robust algorithm for the extraction of the license 
plate region using adaptive thresholding and morphological 
filtering. The recognition stage is based on extracting LDA 
(Linear Discriminant Analysis) features with a neural network 
classifier. Preliminary experiments on the system have been 
carried with real images of vehicles captured under various 
conditions. The proposed system is shown to achieve high 
recognition accuracy under different illumination conditions. 

Keywords-component; license palte recognition, LDA, Arabic 
character recognition; GCC countries 



I. 



INTRODUCTION 



In recent years, research on intelligent transportation 
systems (ITS) has gained a lot of attention. Such systems cover 
a multitude of technologies subdivided into intelligent 
infrastructure systems and intelligent vehicle systems [13]. As 
one of main forms of ITS technology, Automatic License Plate 
Recognition (ALPR) is an important technique that is used for 
the identification of vehicles. There are many applications that 
could benefit from such technology including entrance 
admission, security, parking control, airport or cargo control, 
road traffic control, speed control, toll gate automation, and so 
on. The two main components of any ALPR system are: plate 
localization & segmentation, and character segmentation & 
recognition. The inaccurate detection of the plate and 
characters leads to a useless recognition stage. On the other 
hand, character recognition is an essential and important step in 
any ALPR system, which influences significantly the overall 
accuracy and processing speed of the whole system [2, 14]. 
The problem is that most researchers either focus on plate 
detection or character recognition, but not on both. For 
localization and character segmentation, we usually need to use 
some edge detection algorithm so that the final output 
segmented characters are binary images. In this respect, we 
propose to use morphological filtering followed by a robust 



binarisation algorithm using a novel adaptive thresholding 
technique. For the recognition stage, most previous work 
concentrated on using artificial neural networks (ANN) [16]. 
ANNs can achieve promising performance if the quality of the 
given image is good. However, the quality of images taken in 
real applications is not always high. This is due to the operating 
conditions (e.g. dust) and distortion or degradation due to poor 
image acquisition environment/equipment. Experiments have 
shown that it is difficult to achieve high recognition rates only 
by feeding the data from the images into the neural network 
[3]. In this paper, we propose to preprocess the segmented 
character images using a Linear Discriminant Analysis (LDA) 
to transform such images into small dimension feature vectors, 
then use these vectors as inputs to the ANN classifier. 
Preliminary work on the system has been carried on real 
images of vehicles captured under various illumination 
conditions with excellent overall performance compared to 
existing systems dealing with Arabic characters. It is worth 
noting that there were very few attempts, to date, in developing 
robust ALPR systems that support Arabic, Latin characters, 
and numerals. 

II. Background 

Before presenting the proposed system, a brief discussion 
will be given on current ALPR systems for Arabic, and Latin 
characters, in addition to number-based car plates systems. A 
typical ALPR system is composed of the following steps (see 
Fig. 1): 



A. License Plate Region Extraction 

Such a step is very critical to the success of any ALPR system. 
Park et. al. [4] developed a model for extracting Korean 
license plates based on color while Kim [5] proposed a system 
to extract the plate based on the Hough transform method. 
Hontani et. al. [6] developed a method for extracting the plate 
without knowing its position and the image size. Their 
approach was based on a scale shape analysis. Ahmed et. al. [7] 
proposed a vertical edge detection algorithm followed by edge 
matching based on the size and B/W ratio of the plate. 
Numerous other techniques have also been proposed in the 
literature, see references [12] for a detailed survey of such 
techniques. 
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Fig. 1 : A Typical ALPR system 



B. License Plate Segmentation 

Before any classification can be carried out, there is a need 
to segment the license plate characters from the extracted plate. 
Fortunately for Arabic characters based LPR systems, like in 
GCC countries, all the characters are in their isolated form. 
Exception is the country name, which can be easily isolated 
and dealt with separately. This makes the segmentation stage 
simple as can be seen in the next section [16]. 

C. License Plate Recognition 

A number of statistical, syntactic, and neural approaches 
have been developed for the recognition stage. Cowell and 
Hussain [8] identified the characters based on the number of 
black pixel rows and columns of the character and comparison 
of the values to a set of templates or signatures in a database. 
Furthermore, the thinning of Arabic characters was also 
discussed in [9] to extract essential structural information of 
each character, which is then used for the classification stage. 
Template matching was proposed by Zidouri [7]. This 
approach involves the use of a database of character or 
templates, and for each possible input character there is a 
separate template. Correlation techniques are generally used to 
identify the difficult characters. 

A structural or syntactic approach to recognize characters in a 
text document was adopted by Hamami [10]. This technique 
yields better results when applied on individual characters. 
However, since this approach is based on the detection of holes 
and concavities in the four directions (up, down, left, and 
right), it may not be appropriate for Arabic characters with low 
resolution but may work for numerals. Additionally, secondary 
characteristics are used in order to differentiate between the 
characters. Numerous systems have used neural networks for 
recognition [11]. Such systems as usually very effective, 
however their accuracy depends heavily on the size of the 
training data [12,17]. 

III. The Proposed ALPR System 

The proposed system in Fig. 2 and 3 is intended for a 
parking application in the GCC countries. Our preliminary 
experiments were carried out for Saudi license plates for the 
moment with a plan for extending the coverage to all GCC 
countries and beyond. 
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Fig. 2: The Proposed ALPR system 
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Fig. 3: Flow chart of the proposed algorithm 

The system starts first by acquiring the image of the car 
containing the LP region (we used in our experiments frontal 
images). Sample images taken in our experiments are displayed 
in Fig. 4. Next, the license plate of the car is extracted by 
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applying a series of operations. The extracted LP region is 
further divided into individual characters. For recognition, the 
isolated characters are then transformed and fed into a pre- 
trained neural network. 

The sequence of recognized characters is then checked with a 
database. If the car is recognized, the system would then 
command the authorization for access or for billing or for 
sending the number plate to a central computer for further 
processing depending upon the application of interest. 

The candidate LP region extraction is the key step in an 
LPR system, which influences the accuracy of the system 
significantly. The goal of this phase is, given an input image, to 
produce a number of regions that have high probability of 
containing a license plate. In the adopted approach, the 
extraction of the candidate region from a set of probable 
regions is carried out in four steps. These are explained in the 
following subsections. 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol 8, No. 8, November 2010 
noticed from the collected data, that most car images exhibited 
more horizontal lines than vertical lines. 




Fig. 4: Sample images from the database 

A. License Plate Extraction 

The LP extraction is the main step in the overall system. 
The accuracy of this step affects the accuracy of the whole 
system significantly. This phase extracts the region of interest, 
i.e. the license plate from the acquired image. This step 
consists of three stages: 

1) Edge Detection 
First of all, the acquired image should be converted from a 
color RGB image into a gray level image (see Fig. 5). Different 
gradient-based edge detectors were implemented. In our 
experiments, the Canny edge detector was used. The Canny 
Edge Detector was shown to achieve low error rate, localized 
edge points, and a single point edge response (see Fig. 6). We 




Fig. 5: The original acquired image & its gray-scale image 




Fig. 6: Detected edges using Canny' s method and edge 
enhancement using a thickening mask 

Although the Canny' s Edge Detector proved to be the 
best technique for detecting the LP region, an additional 
mask was required to enhance the detected edges. The 
thickening mask is used for this purpose. The thickening 
operation is the dual of morphological thinning and helps in 
strengthening the edge structure of the image (see Fig. 6). 

2) Detecting the Closed Boundaries of the LP Candidate 
Regions 

By tracing the pixels in the edge detected image, the 
closed structures are considered as boundaries of candidate 
LP regions. Tracing is carried using a 8- neighborhood 
scheme. After tracing the connected pixels, a set of pre- 
defined conditions is then used on these boundaries for 
identifying the closed contours that are LP mostly likely 
candidates. The conditions we use for this purpose are: 

• Length and Shape Filtering: This step is based on a pixel 
counting approach. In our specific application, the length 
of the LP usually ranges between 1000 and 2400 pixels 
(from training). Our approach is to trace all boundaries 
within the given image, and select the boundaries 
containing between 1000 and 2400 pixels. 

• Width/Length Ratio: The standard ratio of width to height 
in normal Saudi license plates is 2:1. To take into 
considerations small angle deviations, we used a ratio 
range of 1.3 to 2.7. As for long license plates, the ratio is 
around five (practically, between 3 and 5.5). If the area 
under study satisfies the above conditions, it is then seen 
as a good LP candidate. 

• Black to White Ratio: Normal Saudi license plates are 
white with black characters. It is found that the black to 
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white ratio is around 20-25% while the mean is between 
150 and 170 gray levels. Only regions satisfying this 
condition are considered as potential LP candidates. 
The above conditions were found to lead to a single LP 
region for most images from the database, (see Fig. 7). 




Fig. 7: Final results from testing the different LP candidate 
regions 

3) Filtering 

Once the candidate LP region is chosen, some standard 
filtering tools are used to remove unwanted noise (Impulsive 
and Gaussian) and enhance the selected LP region. 



B. License Plate Segmentation 

After localizing the LP region, a robust segmentation is 
required in order to extract the 6 characters. Before extracting 
the characters, we need first to identify and remove the word 
"Saudia", then carry vertical and horizontal projections. 

1) Removing the Word "Saudia " ( ^J* — ^) 

The approach for filtering out the word "Saudia" from the 
LP region depends upon the type of license plate under 
analysis. 

Case 1 : Long LP: For this case, we remove the word area from 
the middle region on the x-axis and from the middle region on 
the y-axis as shown in Fig. 8. This region is then filled and 
replaced with the average gray level of the plate. 

Case 2: Normal LP: For normal license plates, we delete the 
top of image containing the word "Saudia" as shown in Fig. 8. 




rWfc 



40% 



r6W 



Fig. 8: Filtering the word "Saudia" from the license plates 



2) Horizontal and Vertical Projections 

To obtain the separate characters, we use the horizontal and 
vertical projections approach. The horizontal projection is used 
to remove unwanted noise in the upper and lower regions that 
do not carry information about the characters as shown in Fig. 
9. While the vertical projection aims at separating the different 
characters as shown in Fig. 10. A threshold is applied on the 
vertical and horizontal projections to obtain the individual 
images of the characters which are usually not of identical size. 




Fig. 9: The horizontal projection of the license plate 




Fig. 10: The vertical projection of the license plate 
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C. Normalization and Character recognition 

There are numerous techniques in pattern recognition that 
have been applied successfully to the problem of character 
recognition. For Arabic characters, techniques such as 
statistical, syntactic, and neural networks have been in use 
[4,12]. Most errors in these systems come from the 
segmentation stage. This is because Arabic writing is 
connected even if it is printed or typewritten. Fortunately for 
our application, all the characters are in their isolated form 
and therefore easily segmented . 

Before discussing the details of the classification stage, we 
need to outline the preprocessing stage of the segmented 
characters. In particular, this first stage consists of a 
normalization step followed by a binarization step. 

1) Normalization and Binary Images 

Even though the heights of the extracted character images are 
all the same, the widths on the other hand may be slightly 
different (from the projection stage). For this reason, we 
normalize all the segmented imaged to 512x256 pixel images. 



♦ 



* r 




\ I 




Fig. 1 1 : Sample characters images 
after normalization and resizing 

To simplify the complexity of the classification stage, we 
introduce two additional steps. First, all the images were 
resized to 1/16 of their original size resulting in images with 
128x64 pixels (see Fig. 1 1). 

Furthermore, the images were complemented to make the 
characters appear as white pixels over a black background. 
The resulting images were then converted into binary images 
using a novel adaptive thresholding approach. A very high 
threshold is initially used and the number of white pixels over 
the total number of pixels is determined. The threshold is then 
lowered in steps until a jump in this ratio is observed. The 
optimal threshold is then chosen as the threshold used before 
the jump in the value of the ratio occurs. 

2) Feature Extraction and Classification using ANN 
Once the images are normalized and converted into binary 
images, the data is not ready for processing by the classifier. 
Previous work with Artificial Neural Networks in classification 
has shown excellent results when the quality of the images used 
for training is high and when the training database is large 
enough. However, the quality of images taken in real 
applications is not always high. This is due to the operating 
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conditions (e.g. dust) and distortion or degradation due to poor 
image acquisition environment/equipment [12]. Previous 
experiments have shown that it is difficult to achieve high 
recognition rates only by feeding the data from the images into 
the neural network [3]. In this paper, we propose to preprocess 
the segmented character images using a Linear Discriminant 
Analysis (LDA) to transform such images into small dimension 
feature vectors, then use these vectors as inputs to the ANN 
classifier. 

Linear discriminant analysis (LDA) is a classical dimension 
reduction method that aims at finding the optimal projection 
directions to maximize the ratio of the between-class scatter 
and the within-class scatter. After finding the projected 
directions, data can be mapped to a low-dimensional subspace. 
This lower dimension subspace can be seen as the subspace of 
feature patterns (see [18] for more details. Once the feature 
vectors for both the training database of characters and the test 
feature vectors are obtained, the classification stage using ANN 
is performed. Note that the size of the feature vectors is now 
very small (around 20x1) as compared to the size of the 
original images (128x64x1). 

3) Classification using ANN 

Artificial Neural Networks have gained a lot of popularity in 
different pattern recognition applications. An ANN is basically 
a computational model inspired from the way our biological 
neural system works. It uses neurons as its processing 
elements; working in parallel as a well as a unified network to 
solve learning problems. The interesting aspect of a neural 
network is that, like people, it learns by example and improves 
by training. There are different structures of ANNs. The most 
popular is the Multilayer Perceptron (MLP) model using the 
back-propagation (BP) algorithm for training. Such a network 
gained a lot popularity in classification tasks due to its 
flexibility, robustness, and computational efficiency. 
The general model of the MLP consists of a number of nodes 
arranged in multiple layers with connections between the nodes 
in the adjacent layers by weights. Each of the hidden layers 
consists of a number of neurons. Each neuron is connected to 
the input via a link with a certain weight. The output of the 
neuron is applied to a non linear function. 
If Vi(n) is the output of 1 th neuron at the n th iteration, Wji(n) is 
the synaptic weight connecting the output of the i th neuron to 
the j th neuron, then the local field Vj(n) induced at the input of 
the activation function associated with neuron j is given by: 



Vj(n) = ^Wji(n)yi(n), 



(1) 

where m is the total number of the inputs applied to the neuron 
j. Let's denote the nonlinear function applied to the output of a 
given neuron as <D(.), then the output at the j th neuron becomes: 
y J (n) = ^ J (v J (n)). 

The most popular nonlinear activation functions used are the 
"Tangent sigmoid" and the "Logarithmic sigmoid". 
Using the conventional BP algorithm, the synaptic weights 
Wji(n) are updated based on the following principle: 
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w Jt (n + 1) = WjXn) + T]Sj(n)yXn), 



(3) 

where, r| is the learning rate, and 5j(n) is called local gradient 
associated with the f 1 neuron. If the neuron j is an output 
neuron, then the local gradient used in the iterations is obtained 
from: 

S J (n) = e J (n)0 J (v J (n)X 

Where ej(n) is the error between the output of neuron yj(n) and 
its desired response dj(n). If the neuron j is a neuron from the 
hidden layer, then the local gradient according to the following 
expression: 



Sj (n) = ®j (vj (w))5% (n)w kj (n). 

k 



(5) 



Many neural network architectures have been proposed for 
use in LPR systems. The most popular neural networks are the 
multilayer feed forward neural network, where neurons are 
grouped between layers and connections between neurons and 
consecutive layers are permitted. In this project we have used 
the multilayer perception. 

In our application, the input characters are now represented by 
feature vector of dimension 20x1. Hence, the network 
receives the 20-element input vectors. After the training stage, 
the network is then used to identify the input character. 




Input 
layer 



Hidden 
layer 

Fig. 12: Structure of an MLP network 



Output 
layer 



The 27 elements of the output vector represent either a letter 
or a number. The Saudi LPs possible characters include 17 
Arabic alphabet letters and 10 Indian numbers (see Fig. 13). 



Fig. 13: Alphanumerals included in Saudi LPs (17 
letters & 10 numbers) 

The network is designed as a two-layer log-sigmoid/log- 

sigmoid network. The hidden (first) layer is chosen to have 20 

neurons. 

The different stages in training are: 

• The network is first trained on ideal vectors until it has a pre- 
defined low sum- squared error. 

• Then, the network is trained on various sets of ideal and 
noisy vectors. Noisy observations are used to maintain the 
network's ability to classify ideal input vectors. 

The network is again tested on just ideal vectors. This 
ensures that the network responds perfectly when presented 
with ideal characters. 

IV. Experimental Results 

We now discuss the results of the proposed ALPR system. 
The discussion is based on the results obtained from the 
previous stages. The first stage discusses the extraction 
technique used in the LPR system. Then, the second stage deals 
with the segmentation part. Next, the results of the neural 
network recognition technique are illustrated. Finally, the 
whole system performance is described. 

For the sake of comparison, we also implemented the technique 
discussed in [13] using the AdaBoost algorithm. Both systems 
were trained with 500 car images and 300 other images were 
used for testing. We focused on the accuracy of the three 
stages discussed above, in particular, LP correct detection, 
Correct character segmentation, and Character recognition 
accuracy. The results are summarized in Table 1. 



Table 1: THE RESULTS OF THE EXTRACTION ALGORITHM 



Algorithm 



Tested 
Plates 



Successful Accuracy in 



LP 

extraction 



character 
segmentationlRate 



Character Overall 
Recognition Recognition 



Rate 



Algorithm 
from [14] 



300 



278 
(92.7%) 



266 
(95.6%) 



263 
(98.8%) 



263 
(87.6%) 



Our 
Algorithm 



300 



291 
(97%) 



287 
(98.6) 



285 
(99.3%) 



285 
(95%) 
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The table shows clearly that the proposed system achieves 
excellent results at all stages of the process. The overall 
accuracy is defined as the product of all intermediate 
accuracies. The performance of the proposed system 
outperforms by far the results obtained in [13]. 
We also tested the accuracy of the system when additional 
noise was considered on the plates. The results have been 
excellent down to an SNR of 5 dB. The system is also resilient 
to moderate tilt in the license plate. 

V. Conclusion 

License plate recognition systems exist and are 
commercially available for most European and Asian car 
plates. However, there were very few attempts made in 
developing systems for Arabic characters. In this paper, we 
propose an ALPR system of low cost and computational 
complexity for Saudi Arabian license plates, and easily 
extendable to other GCC car plates. Beside the use of Arabic 
language, Saudi Arabian license plates have several unique 
features that are taken into account in the segmentation and 
recognition phases. The system was tested over a large number 
of car images taken under different conditions. The overall 
accuracy obtained was above 95%. Two major improvements 
were made to existing systems. First a more robust 
preprocessing stage was introduced which includes an adaptive 
thresholding approach for binarization of the images. Second, 
the NN classification stage was enhanced and simplified by 
working with LDA features extracted from the normalized 
character images. 
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Abstract: A new optimization algorithm is 
developed in this paper as a Combined 
Algorithm of particle swarm optimization, is 
presented, based on a novel philosophy by 
modifying the velocity update equation. This 
is done by combined two different PSO 
algorithms i.e., Standard Particle Swarm 
Optimization and Personal Best Position 
Particle Swarm Optimization. Its 
performance is compared with the standard 
PSO (SPSO) by testing it on a set 15 of 
scalable and 13 non-scalable test problems. 
Based on the numerical and graphical 
analyses of results it is shown that the CAPSO 
(Combined Algorithm of Particle Swarm 
Optimization) outperforms the SPSO 
(Standard Particle Swarm Optimization), in 
terms of efficiency, reliability, accuracy and 
stability. 

Keywords: Particle Swarm Optimization, 
CAPSO (Combined Algorithm of Particle 
Swarm Optimization), global optimization, 
velocity update equation, Personal Best 
Position Particle Swarm Optimization. 

I. INTRODUCTION 

Standard Particle Swarm Optimization (SPSO): 

Particle swarm optimization (PSO) [1], [2] is a 
stochastic, population-based search method, 
modeled after the behavior of bird flocks. A PSO 
algorithm maintains a swarm of individuals 
(called particles), where each individual (particle) 
represents a candidate solution. Particles follow a 
very simple behavior: emulate the success of 
neighboring particles, and own successes 
achieved. The position of a particle is therefore 
influenced by the best particle in a neighborhood, 
as well as the best solution found by the particle. 

Particle position x / are adjusted using 



x i (t + l)=x i (t) + v i (t + l) 



...(1) 



where the velocity component, v. (t) represents 
the step size. For the basic PSO. 

where w is the inertia weight [11], c x andc 1 are 
the acceleration coefficients, ri .,r 2 .~U(0,l)> ># * s 
the personal best position of particle / , and y 
is the neighborhood best position of particle / . 
The neighborhood best position y . , of particle / 
depends on the neighborhood topology used [3], 
[4]. If a star topology is used, then j). refers to 

the best position found by the entire swarm. That 
is, 

y t ~ {y (0, y 1 (0, •■■•, y s (0) =min(/(^ (0), f(y 1 (t)), ...., f(y s (0) 
where S is the swarm size. The resulting 
algorithm is referred to as the global best PSO. 
For the ring topology, the swarm is divided into 
overlapping neighborhoods of particles. In this 
case, y. is the best position found by the 

neighborhood of particle / . The resulting 
algorithm is referred to as the Local best PSO. 

The Von Neumann topology defines 
neighborhoods by organizing particles in a lattice 
structure. A number of empirical studies have 
shown that the Von Neumann topology 
outperforms other neighborhood topologies [4], 
[5]. It is important to note that neighborhoods are 
determined using particle indices, and are not 
based on any spatial information. 

A large number of PSO variations have been 
developed, mainly to improve the accuracy of 
solutions, diversity, and convergence behavior 
[6], [7]. This section reviews those variations 
used in this study, from which concepts have 
been borrowed to develop a new, parameter-free 
PSO algorithm. 
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Van den Bergh and Engelbrecht [8], [9], and 
Clerc and Kennedy [3], formally proved that 
each particle converges to a weighted average of 
its personal best and neighborhood best positions. 
That is, 






(3) 



This theoretically derived behavior provides 
support for the barebones PSO developed by 
Kennedy [10], where the velocity vector is 
replaced with a vector of random numbers 
sampled from a Gaussian distribution with the 

mean defined by equation (3), assuming that c x = 
c 2 , and deviation, 

ru y ij\ 
The velocity equation changes to 



v v (* + 1) ~ N( 



y, 



y tJ 



,<*) 



The position update then changes to 
x.(t + 1) = v,.(f + 1) 
Kennedy [9] also proposed an alternative version 
of the barebones PSO, where 



v„ (/ + !) = 



N( y,J+y,J ,a) 



if U(0,\)< 0.5 



otherwise 



..(4) 



Based on equation (4), there is a 50% chance that 
the j-th dimension of the particle dimension 
changes to the corresponding personal best 
position. This version of the barebones PSO 
biases towards exploiting personal best positions. 

Silva et al. (2002) presented a predator- 
pray model to maintain population diversity. 



OR 



v iJ (t + l) = 2wv..(t) + 2c l r lJ (y..-x..) 



+ c 2 r 2j (y j -2x jj ) 



(5) 



In the velocity update equation of this new PSO 
the first term represents the current velocity of 
the particle and can be thought of as a 
momentum term. The second term is 



iij(yij- x ip 



IS 



proportional to the vector ic x 

responsible for the attractor of particle's current 
position and positive direction of its own best 
position (pbest). The third term is proportional to 

the vector c 2 r 2J (y. -2*. ) , is responsible for the 
attractor of particle's current position. 

Figure: -I: Comparative movement of a particle in 
SPSO and CPSO 




The pseudo code of CAPSO is shown below: 
ALGORITHM- CAPSO 



II. THE PROPOSED COMBINED 
ALGORITHM OF PSO 

The motivation behind introducing CAPSO is 
that in the velocity update equation instead of 
comparing the combined two difference PSO 
algorithm update velocity equation i.e., SPSO 
(Standard Particle Swarm Optimization) and 
PBPPSO (Personal Best Position Particle Swarm 
Optimization). 

Thus, we introduce a new velocity update 
equation as fellows: 

v ij ( t + ^ = wv ij O + c l r lj by ~ x ij ) + c 2 r 2j ^j - x ij) + 
wv ij O + c \\j ^ij ~ x ij ) + c 2 r 2j (-Xjj ) 



For t = 1 to the max: bound of the number on 

iterations, 

For i = \ to the swarm size, 

For j ■ = i to the problem dimensionality, 

Apply the velocity update equation (5); 
Update Position using equation (2); 
End- for- j 

Compute fitness of updated position; 

If needed, update historical information for 

personal best position and global best position; 

End-for-i; 

Terminate if global best position meets problems 

requirements; 

End- for- 1 ; 

END ALGORITHM 
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III. THE TEST BED 

Many times it is found that the evaluation of a 
proposed algorithm is evaluated only on a few 
benchmark problems. However, in this paper we 
consider a test bed of thirty benchmark problems 
with varying difficulty levels and problem size. 
The relative performance of SPSO and CAPSO 
is evaluated on two kinds of problem sets. 
Problem Set 1 consists of 15 scalable problems, 
i.e., those problems in which the dimension of 
the problems can be increased / decreased at will. 



In general, the complexity of the problem 
increases as the problem size is increased. 
Problem Set 2 consists of those problems in 
which the problem size is fixed, but the problems 
have many local as well as global optima. The 
Problem Set 1 is shown in Table 1 and Problem 
Set 2 is shown in Table 2. 



Table-1: Details of Problem Set-I (Continued) 



Serial 

No 


Function Name 


Expression 


Search Space 


Objective Function 
Value 


1. 


Ackley 


Minf(x) = - 20exp(-0.02 In -1 X xf ) 

-1 n 
- exp(« E cos(;nc.)) + 20 + e 
i=\ l 


-30 < x i < 30 





2. 


Cosine Mixture 


n n 
Minf(x) = -0.1 I cos(5ttx.) + E x z 
i=\ l i=\ l 


- 1 < x . < 1 


-O.lx in) 


3. 


Exponential 


n i 
Minf(x) = (-0.5 E x z ) 
i=\ l 


-1 < x t < 1 


-1 


4. 


Griewank 


Mmf(x) = l + ^^xf-n C os(^) 


-600 < x t < 600 





5. 


Rastrigin 


n i 
Minf(x) = \0n+ E [x z -10cos(2;rx.)] 
/=1 ' ' 


-5.12 < x t < 5.12 





6. 


Function '6' 


Minf(x)= E [100(x. +1 -x 2 ) 2 + (x.-l) 2 ] 
/=1 


-3 < jc. < 30 





7. 


Zakharov's 


n n . 2 n i 4 
Mmf(x) = E xf + [ E (-)* ] + [ E (-)* ] 


-5.12 < jc, < 5.12 





8. 


Sphere 


n 

Minf(x) = E x z 

i=l l 


-5.12 < jc- < 5.12 





9. 


Axis parallel hyper 
ellipsoid 


M in f (x) = ^ ix f 


-5.12 < jc- < 5.12 





10. 


Schwefel'3' 


Min f{x) = 2 k,|+ 11 \ x '\ 


-10 < jc- < 10 





11. 


Dejong 


Min f(x) = ^ (x, 4 + rand(0,\)) 


-10 < x, < 10 





12. 


Schwefel '4' 


Minf(x) = Max{ 


x i 


,!</<«} 


-100 < jc- < 100 





13. 


Cigar 


M in f (x) = xf + 100000 J xf 


-10 < jc- < 10 





14. 


Brown '3' 


Min f(x) = X [ixf)ixf +1 + 1) + ix? +1 + \)ix? + 1)] 
i=i 


- 1 < jc . < 4 





15. 


Function '15' 


Min fix) = Y, [0.2 xf + . 1 xf sin 2 x.] 


-10 < jc- < 10 






Table 2: Details of Problem Set-II 



Serial 

No 


Function Name 


Expression 


Search Space 


Objective 
Function Value 


1. 


Becker and Lago 


Min fix) =(|*i|" 5) 2 + (|* 2 |- 5) 2 


- 1 < x . < 1 





2. 


Bohachevsky ' 1 ' 


Min fix) = xf + 2xf- 0.3 cos(3^)- 0.4 cos(4;rx 2 ) + 0.7 


-50 < x l ,x 2 < 50 
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3. 


Bohachevsky '2' 


Minf(x) =jcf +2*2 -0.3cos(37rx 1 )cos(47rx 2 ) + 0.3 


-50 < x x , x 2 < 50 





4. 


Branin 


Mn f(x) = a(x 2 - bx\ + cx 1 - df + g(\ - h) cos(^ ) + g 

a = 1, b = — ^, c = — , d = 6, g = 10, h = 

\K 7t %7Z 


-5 < Xj < 100 
-5 < x 2 < 15 


0.398 


5. 


Eggcrate 


2 2 2 2 
Minf(x)=x^ + %2 + 25 (sin Xi + sin Xj ) 


-2 n < x i < 2 n 





6. 


Miele and Cantrell 


4 6 4 8 
Mnf{x) =(exp(xi ) - x , ) + 100(x 2 - x? ) + (tan(x, - x a )) + x. 


- 1 < x t . < 1 





7. 


Modified 
Rosenbrock 


2 2 2 2 
Minf(x) =100(x 2 - Xj ) + [6.4(x 2 - 0.5) - Xj - 0.6] 


-5 < Xj , x 2 < 5 





8. 


Easom 


2 2 
Minf(x) =- cos(xi ) cos(x2 ) exp(-(xi - tt) - {xj -ft) ) 


-10 < x. < 10 


-1 


9. 


Periodic 


2 2 2 2 
Minf(x) =- 1 + sin Xi + sin x^ - 0.1 exp(-Xi - x^ ) 


-10 < x. < 10 


0.9 


10. 


Powell's 


2 2 4 4 
Mnf{x) =(xi + 10x 2 ) + 5(x^ - x, ) + (x 2 - 2x^ ) + 10(xi - x, ) 


-10 < x, < 10 





11. 


Camel back-3 


M in f (x) = 2x\ + 1 .0 5Xj 4 + — x* + x x x 2 + x\ 
6 


-5 < Xj , x 2 < 5 





12. 


Camel back-6 


2 4 16 2 4 
Minf(x) =4xi + 2.1xi + t x i +^1^9 _ ^ x 2 + ^ x 2 


-5 < Xi , x 2 < 5 


-1.0316 


13. 


Aluffi-Pentini's 


4 4 2 2 
Minf(x) =0.25x 1 - 0.5xj - 0.5xj + O.lxj + 0.5x 2 


-10 < Xj < 10 


-0.3523 



IV. ANALYSES OF RESULTS 

The SPSO and the CAPSO are coded in C++ and 
implemented on Pentium-IV 2.4 GHz machine 
with 512 MB RAM under WINXP platform. 
Thirty independent runs with different seed for 
the generation of random numbers are taken. 
However, the same seed is used for generating 
the initial swarm for SPSO and CAPSO for the 
t run, where i = 1,2,. ..,50 . 

A run is said to be a successful run if 
the best objective function value found in that 
run lies within 1% accuracy of the best known 
objective function value of the problem. 

The maximum number of function 
evaluations is fixed to be 30,000. The swarm size 
is fixed to 20 and dim is 30. The inertia weight is 
0.7 and the acceleration coefficients for SPSO 



and CAPSO are set to be 



= 1.5- 



1 2 

A number of criterions are used to 
evaluate the performance of SPSO and CAPSO. 

The percentage of success is used to evaluate the 
reliability. The average number of function 
evaluations of successful runs and the average 



computational time of the successful runs, are 
used to evaluate the cost. For problem SET-I, by 
fixing for problem measured by the minimum, 
mean, success of rate and standard deviation of 
the objective function values out of fifty runs. 
This is shown in Table 3. The corresponding 
information for problem SET-II is shown Table 
4, respectively. 

In observing Table 3, it can be seen that 
CAPSO gives a better quality of solutions as 
compared to SPSO. Thus, for the scalable 
problems CAPSO outperforms SPSO with 
respect to efficiency, reliability, cost and 
robustness. 

In observing Table 4, it can be seen that 
CAPSO gives a better quality of solutions as 
compared to SPSO. Thus, for the non-scalable 
problems CAPSO outperforms SPSO with 
respect to efficiency, reliability, cost and 
robustness. 

In Table 3, It is observed that SPSO 
could not solve two problems with 100% success, 
whereas CAPSO solved all the problems with 
100% success. 
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Table-3 Comparative Objective function value obtained in 50 runs by SPSO and CAPSO for 

problem Set-I 



Problem 

No. 


Minimum Function 
Value 


Mean Function Value 


Standard Deviation 


Rate of Success 




SPSO 


CAPSO 


SPSO 


CAPSO 


SPSO 


CAPSO 


SPSO 


CAPSO 


1 


0.667619 


0.271435 


16485.6000 


2331.0000 


0.142795 


0.139467 


98.00% 


100% 


2 


0.644392 


0.279915 


1708.20000 


193.20000 


0.053545 


0.129501 


100% 


100% 


3 


0.000000 


0.000000 


60.000000 


60.000000 


0.000207 


0.000080 


100% 


100% 


4 


0.777974 


0.422415 


14364.6000 


5998.8000 


0.026005 


0.117862 


100% 


100% 


5 


27.127816 


0.266949 


30000.0000 


19504.800 


29.809592 


41.94101 


0.00% 


100% 


6 


0.000061 


0.000001 


166.200000 


141.00000 


0.200616 


0.285178 


100% 


100% 


7 


0.000274 


0.000003 


72.000000 


76.800000 


0.229660 


0.233736 


100% 


100% 


8 


0.685057 


0.295695 


6096.00000 


864.00000 


0.054336 


0.155304 


100% 


100% 


9 


0.000002 


0.000001 


60.600000 


63.600000 


0.179978 


0.219324 


100% 


100% 


10 


0.001109 


0.001107 


60.600000 


60.600000 


0.161759 


0.180857 


100% 


100% 


11 


0.601870 


0.098892 


11341.8000 


5126.4000 


0.067786 


0.227513 


100% 


100% 


12 


0.022248 


0.006012 


78.000000 


87.000000 


0.243564 


0.253503 


100% 


100% 


13 


0.001848 


0.001248 


1767.00000 


1767.0000 


0.253535 


0.273535 


100% 


100% 


14 


0.000126 


0.000108 


60.000000 


60.000000 


0.048579 


0.053570 


100% 


100% 


15 


0.000009 


0.000001 


60.000000 


60.000000 


0.005729 


0.004014 


100% 


100% 



Table-4 Comparative Objective function value obtained in 50 runs by SPSO and CAPSO for 

problem Set-II 



Problem 

No. 


Minimum Function 
Value 


Mean Function Value 


Standard Deviation 


Success of Rate 




SPSO 


CAPSO 


SPSO 


CAPSO 


SPSO 


CAPSO 


SPSO 


CAPSO 


1 


0.500000 


0.500000 


60.000000 


60.000000 


0.042453 


0.042452 


100% 


100% 


2 


0.017193 


0.002786 


64.200000 


62.400000 


0.258362 


0.248258 


100% 


100% 


3 


0.001029 


0.001024 


66.600000 


67.800000 


0.224219 


0.257928 


100% 


100% 


4 


0.398600 


0.395682 


128.400000 


175.200000 


0.137710 


0.148115 


100% 


100% 


5 


0.018613 


0.002431 


72.000000 


72.600000 


0.240972 


0.221812 


100% 


100% 


6 


0.498600 


0.398600 


128.400000 


120.400000 


0.167710 


0.147710 


100% 


100% 


7 


0.027193 


0.017786 


64.200000 


62.400000 


0.358362 


0.288258 


100% 


100% 


8 


0.015341 


0.012461 


82.200000 


95.400000 


0.281294 


0.256433 


100% 


100% 


9 


0.480507 


0.480489 


60.000000 


60.000000 


0.026709 


0.021144 


100% 


100% 


10 


0.067997 


0.051277 


840.600000 


517.200000 


0.215576 


0.253873 


100% 


100% 


11 


0.003378 


0.002978 


60.600000 


64.600000 


0.207517 


0.246517 


100% 


100% 


12 


0.005549 


0.003824 


63.600000 


66.600000 


0.270722 


0.238520 


100% 


100% 


13 


0.002655 


0.002017 


65.400000 


60.000000 


0.229666 


0.181436 


100% 


100% 



ujLla 



1 2 3 4 5 6 7 8 9 101112131415 

Scalable Problems 

Figure A: Comparing the SPSO and CAPSO with 
the help of Scalable 15 Problems SET-I. 



1 2 3 4 5 6 7 8 910111213 

Non-Scalable Problems 

Figure B : Comparing the SPSO and CAPSO with 
the help of Non-Scalable 13 Problems SET-II. 
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V. CONCLUSIONS 

This paper presented new population based algorithm 
CPSO (Combined Algorithm of Particle Swarm 
Optimization). It is represent based on a combined 
two different particle swarm optimization algorithms 
i.e., Standard Particle Swarm Optimization and 
Personal Best Position Particle Swarm Optimization. 
It is tested on 15 scalable problems and 13 non- 
scalable problems. It is shown that the new CAPSO 
(Combined Algorithm of Particle Swarm 
Optimization) outperforms SPSO (Standard Particle 
Swarm Optimization) in terms of efficiency, accuracy, 
reliability and robustness. Particularly for large size 
problems CAPSO outperforms SPSO. In this paper 
the effect of change of parameters in CAPSO is not 
explored. In a future study parameters fine tuning 
may be carried out for better performance. Also the 
application of CAPSO to the real word problems 
would be interesting as a future research. 

REFERENCES 

[1] R.C. Eberhart and J. Kennedy. A New 

Optimizer using Particle Swarm Theory. In 
Proceedings of the Sixth International 
Symposium on Micromachine and Human 
Science, pages 39^13, 1995. 

[2] J. Kennedy and R.C. Eberhart. Particle 

Swarm Optimization. In Proceedings of the 
IEEE International Joint Conference on 
Neural Networks, pages 1942-1948. IEEE 
Press, 1995. 

[3] J. Kennedy. Small Worlds and Mega-Minds: 

Effects of Neighborhood Topology on 
Particle Swarm Performance. In Proceedings 
of the IEEE Congress on Evolutionary 
Computation, volume 3, pages 1931-1938, 
July 1999. 

[4] J. Kennedy and R. Mendes. Population 

Structure and Particle Performance. In 
Proceedings of the IEEE Congress on 
Evolutionary Computation, pages 1671- 
1676. IEEE Press, 2002. 

[5] E.S. Peer, F. van den Bergh, and A.P. 

Engelbrecht. Using Neighborhoods with the 
Guaranteed Convergence PSO. In 
Proceedings of the IEEE Swarm Intelligence 
Symposium, pages 235-242. IEEE Press, 
2003. 

[6] A.P. Engelbrecht. Fundamentals of 

Computational Swarm Intelligence. Wiley & 
Sons, 2005. 

[7] J. Kennedy, R.C. Eberhart, and Y. Shi. 

Swarm Intelligence. Morgan Kaufmann, 
2001. 



[8] F. van den Bergh. An Analysis of Particle 

Swarm Optimizers. PhD thesis, Department 
of Computer Science, University of Pretoria, 
Pretoria, South Africa, 2002. 

[9] F. van den Bergh and A.P. Engelbrecht. A 

Study of Particle Swarm Optimization 
Particle Trajectories. Information Sciences, 
176(8):937-971,2006. 

[10] J. Kennedy. Bare Bones Particle Swarms. In 
Proceedings of the IEEE Swarm Intelligence 
Symposium, pages 80-87, April 2003. 

[11] Y. Shi and R.C. Eberhart. A Modified 
Particle Swarm Optimizer. In Proceedings of 
the IEEE Congress on Evolutionary 
Computation, pages 69-73, May 1998. 

[12] Angline, P.J.(1998a) 'Evolutionary 

optimization versus particle swarm 
optimization philosophy and performance 
differences', Lecture Notes in Computer 
Science, Vol. 1 447,pp.60 1-610, Springer, 
Berlin. 

[13] Angline, P.J(1998b) 'Using selection to 
improve particle swarm 

optimization', Proceedings of the IEEE 
Conference on Evolutionary Computations, 
pp.84-89. 

[14] Banks, A., Vincent, J. and Anyakoha, C. 

(2007) 'A review of particle swarm 
optimization,Part LBackground and 
development', Natural Computing: an 
International Journal, Vol. 6, No. 4, pp.467- 
484. 

[15] Banks, A., Vincent, J. and Anyakoha, C. 

(2008) 'A review of particle swarm 
optimization, Part II: Hybridisation, 
combinatorial, multicriteria and constrained 
optimization and indicative applications', 
Natural Computing: an International Journal, 
Vol. 7, No. I,pp.l09-124. 

[16] Baskar, S. and Suganthan, P.M. (2004) 'A 
novel concurrent particle swarm 
optimization', Proceedings of the Congress 
on Evolutionary Computations, pp. 792-796. 

[17] Deep, K. and Thakur, M. (2007) 'A new 
crossover operator for 
algorithms', Applied 
Computation, Vol. 188, 
Eberhart, R.C. and 
'Comparing inertia weights and constriction 
factors in particle swarm optimization', Proc. 
Congress on Evolutionary Computation, San 
Diego, CA, pp.84-88,. 

[18] Esquivel, S.C. and Coello Coello, CA. 
(2003) 'On the use of particle swarm 



real coded genetic 

Mathematics and 

No. 1, pp.895-911. 

Shi, Y. (2000) 



275 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 8, No. 8, November 2010 



optimization with multi modal functions', [29] 

Proceedings of the Congress on 
Evolutionary Computations, pp.1 130-1 136. 

[19] He, S., Wu, Q.H., Wen, J.Y., Saunders, J.R. 
and Paton, R.C. (2004) 'A particle swarm 
optimizer with passive congregation', 
Biosystems, Vol. 78, pp. 135-147. [30] 

[20] Hendtlass, T. (2003) 'Preserving diversity in 
particle swarm optimization', Lecture Notes 
in Computer Science, Vol. 2718, pp.3 1^40. 
Higasbi, N. and Iba, H. (2003) 'Particle 
swarm optimization with Gaussian 
mutation', Proceedings of the 2003 IEEE 
Swarm Intelligence Symposium, pp. 72-79. [31] 

[21] Hu, X., Shi, Y. and Eberhart, R.C. (2004) 
'Recent advances in particle swarm', 
Proceedings of Congress Evolutionary 
Computation, Vol. 1, pp.90-97. 

[22] Janson, S. and Middendorf, M. (2005) 'A [32] 

hierarchical particle swarm optimizer and its 
adaptive variant', IEEE Transaction on 
System, Man and Cybernetics, Part B, Vol. 
38,pp.l272-1282. 

[23] Krink, T. and Lovbjerg, M. (2002) 'The 

lifecycle model: combining particle swarm [33] 

optimization, genetic algorithms and hill 
climbing', Proceedings of Parallel Problem 
solving from Nature, Vol. 7, pp. 621-630. 

[24] Krink, T., Vesterstrem, J.S. and Riget, J. 

(2002) 'Particle swarm optimization with [34] 

spatial particle extension', Proceedings of 
the Congress on Evolutionary Computation, 
pp. 1474-1479. 

[25] Liang, J.J., Qin, A.K., Suganthan, P.N. and 

Baskar, S. (2004) 'Particle swarm [35] 

optimization algorithms with novel learning 
strategies', Proceedings of the IEEE 
Conference on Systems, Man and 
Cybernetics, pp. 3659-3664. 

[26] Liu, H., Li, B., Wang, X., Ji, Y. and Tang, Y. [36] 

(2004) 'Survival density particle swarm 
optimization for neural network training', 
ISNN (1), LNCS 3173, pp.332-337, 
Springer-Verlag. 

[27] Lovbjerg, M. and Krink, T. (2002) [37] 

'Extending particle swarm optimizers with 
self-organized criticality', Proceedings of 
the Congress on Evolutionary Computation, 
pp. 1588-1593. 

[28] Lovbjerg, M., Rasmussen, T.K. and Krink, 
T. (2001) 'Hybrid particle swarm optimizer 
with breeding and subpopulation', 
Proceedings of the Third Genetic and 
Evolutionary Computation Conference, 
pp.469^176. 



Mohais A., Ward, C. and Posthoff, C. (2004) 
'Randomized directed neighborhood with 
edge migration in particle swarm 
optimization', Proceedings of the IEEE 
Conference on 

Evolutionary Computational, pp. 548-555. 
Pasupuleti, S. and Battiti, R. (2006) 'The 
gregarious particle swarm optimizer (G- 
PSO)', 

Proceedings of the 8th Annual Conference 
on Genetic and Evolutionary Computation 
(Seattle, Washington, USA). GECCO '06, 
pp.67-74, ACM, New York, NY. 
Peram, T., Veeramachaneni, K. and Mohan, 
C. K. (2003) 'Fitness-distance-ratio based 
particle swarm optimization', Proceedings 
of the IEEE Swarm Intelligence Symposium, 
pp. 174-181. 

Poli, R., Laugdon, W.B. and Holland, O. 
(2005) 'Extending particle swarm 
optimization via genetic programming', 
Proceedings of the Eighth European 
Conference on Genetic Programming, 
pp.291-300. 

Riget, J. and Vesterstrem, J.S. (2002) 'A 
diversity-guided particle swarm optimizer - 
the ARPSO', Technical Report 2002-02, 
EVALife, Department of Computer Science, 
University of Aarbus. 

Shi, Y. and Eberhart, R.C. (1998a) 'A 
modified particle swarm optimizer', 
Proceedings of the IEEE International 
Conference on Evolutionary Computation, 
pp.69-73. 

Shi, Y. and Eberhart, R.C. (1998b) 
'Parameter selection in particle swarm 
optimization,' 7 th Annual Conference on 
Evolutionary Programming, San Diego, 
USA. 

Y.Fukuyama, "Parctical Equipment Models 
for Fast Distribution Power Flow 
Considering Interconnection of Distributed 
Generators", Pro. Of IEEE PES Summer 
Meeting, 2001. 

F.F. Wu and A.F. Neyer, " Asynchronous 
Distributed State Estimation for Power 
Distribution Systems", Proc. of 10 th PSCC, 
Aug. 1990. 



276 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 8, No. 8, November 2010 



Optimal Solution of 2-Dim Rectangle 
Packing Problem based on Particle Swarm 

Optimization 



Narinder Singh 

Department of Mathematics, 

Punjabi University, Patiala 

INDIA-47201 



S.B. Singh 
Department of Mathematics, Punjabi 
University, Patiala, INDIA,-147201 



J.C.Bansal 

AB V-Indian Institute of Information 

Technology and Management-Gwalior 

(M.P), INDIA 



Abstract-In the rectangle-packing problem, 
rectangular part are placed onto a 
rectangular stock sheet. Which is bigger in 
size in comparison to items with the aim of 
minimizing the unused space. This problem 
belongs to class of NP-complete problems 
where computation time for an exact solution 
increases with N (total items considered in the 
problem) and become rapidly prohibitive in 
cost as N increases. The solution approach to 
these problems lies in finding in optimal 
solutions while reducing the exhaustive search 
of all possible arrangements of nesting the 
parts and subsequently checking upon the 
execution time. Usually, various heuristic rule 
are proposed to generate different patterns 
that are near optimal. These heuristics are 
generally the priority rules used to allocate 
patterns to the stock sheet sequentially. 

In this paper, the optimal solution of 
rectangle-packing find by Particle Swarm 
Optimization. Nee (1991) has been used to 
generate 120 feasible patterns with different 
sheet utilization factors. The algorithm can be 

I. INTRODUCTION 

Background: There are so many small scale 
industries/business houses which are operating in 
our local housing colonies. They are the trader 
building doors, windows and other furniture 
items; tailors putting up all sewing patterns on 
cloth material; supplier who provide marble 
slabs for flooring in houses, where dimensions 
are rarely regular, or, a cobbler determined for 
cutting maximum number of shoes or gloves 
from a piece of leather. On the other hand, we 
have medium scale hosiery firms of Ludhiana 
(India), which are producing shirts, jeans and 
other articles, or, capital intensive industries, like 
transformer division of M/s Crampon Greaves 
Limited at Mumbai and Bhopal, where 50 tons 



used for different object sizes available in 
varying quantity each having many feasible 
patterns so as to meet the demand for items in 
a holistic manner. Also, the solution can be 
obtained by solving the LPP that takes in 
most viable solution generated by the revised 
heuristic, requirements of shapes and 
availability of sheets. 

In the present study a comparison has been 
made that makes use of LPP to look for 
solution Vs. gradually meeting the demand by 
taking optimal feasible solutions generated by 
the heuristic that gives the best sheet 
utilization and exhausting the demand at each 
step till the demand is fully met. 

Keywords: Cutting and Packing, Rectangle 
packing, Nesting, Assortment Problems, 
Heuristics, NP-complete problems, Scrap 
Management, Sheet Layout, Optimization 
Techniques, Particle Swarm Optimization. 



of sheets annually are used towards the 
fabrication of transformers. What's more, the 
resource sheet can be a motherboard that is few 
centimeters in dimensions and looks for so many 
chips to be laid on it or it can be a huge 
rectangular sheet from which brackets/floors are 
to be cut in pairs that are generally symmetric 
about a line for submarines. The share aims of all 
these stated examples is to conserve resource 
material as scrap left-over is not recycled or 
reused, but treated as wastage. Arranging shapes 
on resource material is the critical issue that 
results in ultimate utilization of the resource 
material while observing the manufacturing 
constraints specific to an industrial situation. So, 
they all strive hard by adopting various 
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techniques to conserve the depleting resources 
available to them in their respective fields and 
operate in the market for profit. This problem is 
classified as the problem of Nesting by the 
researchers. 

Particle Swarm Optimization: Particle swarm 
optimization (PSO) [1], [2] is a stochastic, 
population-based search method, modeled after 
the behavior of bird flocks. A PSO algorithm 
maintains a swarm of individuals (called 
particles), where each individual (particle) 
represents a candidate solution. Particles follow a 
very simple behavior: emulate the success of 
neighboring particles, and own successes 
achieved. The position of a particle is therefore 
influenced by the best particle in a neighborhood, 
as well as the best solution found by the particle. 

Particle position x t are adjusted using 

*,(f + l)=jc,(0 + v,(/ + l) ..(1) 
where the velocity component, v ; . (t) represents 
the step size. For the basic PSO. 

v^+\) = v y it)+c^ j {y y -x^+c^ j {y j -x l ^ ..(2) 
where w is the inertia weight [11], c andc 2 are 
the acceleration coefficients, r x .,r 2 . ~ £/(0,l) , y jf 
is the personal best position of particle i, and y . 
is the neighborhood best position of particle i. 

The neighborhood best position y i , of particle i 

depends on the neighborhood topology used [3], 

[4]. If a star topology is used, then jX refers to 

the best position found by the entire swarm. That 
is, 

y t ~{y (0^ 1 (0v...,x(0}=nir</u®U(yXt)),...J^X0) 

where S is the swarm size. The resulting 
algorithm is referred to as the global best (gbest) 
PSO. For the ring topology, the swarm is divided 
into overlapping neighborhoods of particles. In 

this case, jX is the best position found by the 

neighborhood of particle i. The resulting 
algorithm is referred to as the Local best (lbest) 
PSO. 

The Von Neumann topology defines 
neighborhoods by organizing particles in a lattice 
structure. A number of empirical studies have 
shown that the Von Neumann topology 
outperforms other neighborhood topologies [4], 
[5]. It is important to note that neighborhoods are 



determined using particle indices, and are not 
based on any spatial information. 

A large number of PSO variations have been 
developed, mainly to improve the accuracy of 
solutions, diversity, and convergence behavior 
[6], [7]. This section reviews those variations 
used in this study, from which concepts have 
been borrowed to develop a new, parameter-free 
PSO algorithm. 

Van den Bergh and Engelbrecht [8], [9], and 
Clerc and Kennedy [3], formally proved that 
each particle converges to a weighted average of 
its personal best and neighborhood best positions. 
That is, 



lim x„ 



c iy J i + c 2 y ll 



(3) 



This theoretically derived behavior provides 
support for the barebones PSO developed by 
Kennedy [10], where the velocity vector is 
replaced with a vector of random numbers 
sampled from a Gaussian distribution with the 

mean defined by equation (3), assuming that C x = 
C 2 , and deviation, 

a = \y. — v.. 
The velocity equation changes to 

v tj {t + \)~N{ yiJ+ 2 yiJ >°) 

The position update then changes to 

x.(t + 1) = v t (t + 1) 
Kennedy [9] also proposed an alternative version 
of the barebones PSO, where 



v„ (/ + !) = 



N( 



y l} 



CT) 



if U(0,1)^ 0.5 



otherwise 



(4) 



Based on equation (4), there is a 50% chance that 
the j-th dimension of the particle dimension 
changes to the corresponding personal best 
position. This version of the barebones PSO 
biases towards exploiting personal best positions. 

II. PROBLEM DEFINITION 

The basic logical structure of nesting problem 
investigated in this research paper presumes that 
the stock sheet/object and items to be nested are 
rectangular in nature, which categorizes it as 
Rectangle packing/Nesting. Accordingly, there is 
a stock of large geometrically defined objects 
stock sheets that are basically rectangles with 
different dimensions and an order list for small 
geometrically defined items (ordered items) also 
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rectangles of diverse lengths and widths. A 
layout pattern is defined as an arrangement 
indicating various shapes that can be laid on a 
stock sheet. So a solution to a nesting problem 
consists of a set of layout patterns together with 
instructions as to how often each layout pattern 
is to be used so as to meet the requirement. 

The problem is solved:- 

A. Using LPP approach in a holistic 
manner when each stock sheet is 
associated with different layouts each 
that guarantee different sheet utilization 
value and layout pattern as suggested by 
the rectangle packing heuristic [Sing 
and Jain, 2009]. The solution is 
obtained by solving LPP in a holistic 
manner by ceiling floating-point 
solutions to integer values. 

B. By exhausting requirement regularly by 
making use of rectangle packing 
heuristic by considering the layout 
pattern that result in best sheet 
utilization. 

Hypothesis: - The results produced by the two 
approaches do not differ substantially. 

III. MATHEMATICS FORMULATION OF 
LINEAR PROGRAMMING PROBLEM 

Let 

K Maximum number of feasible layout patterns 
considered for each available sheet. 

J Number of possible stock sheets available in 
the reserve. 1 < j <m 

I Number of possible objects in the order list 
\<i<n 

P.. k Possible pieces of the / space found on the 

j stock sheet in k layout pattern. 

X.. k Number of times k layout pattern of 

•th 

J stock sheet enter into optimal solution 
of the problem. 

'th 

q Requirement for each I shape as per the 

order list. \<i<n 
I. Maximum number of / stock sheet 

available in the reserve. \< j <m 
T. k Trim loss associated with each k layout 



pattern of j stock sheet 

A. Area of each / stock sheet available in 

j J 

the reserve. 

-th 

a i Area of each / object in the order list. 

TPC Total number of feasible pattern 
considered in LPP. 

Objective function is defined as: 

Maximize^ ^{UF^X^) \<k<TPC; \<j<i 
Minimize^ ^Jj.XX,,) \<k<TPC; \<j<m; 

Subject to the following constraints 

• Requirement for each object in the order 
list 

Z/Z^X^)^ Vi,l<k<IPQl<j<n<l<i<n 

• Reserve of stock sheets 
X,^>/ ; Vj,l<k<TPC; \<j<m; 

• Natural Constraint 

X..>0 Vy, 1 < k < TPC ; 1 < j < m ; 

• Feasibility Constraint 

X h A i - X ^i a i 1 - J - m > l - z - n > 

IV. BRIEF SYNOPSIS OF HEURISTIC 

One such heuristic study which is focused to 2- 
Dim rectangular packing problem where 
rectangular items are packed on to a larger 
containing region rectangular in shape, say 
object. The specific problem addressed is 
characterized by a set of rectangular items, 
which may contain identical items that can be 
rotated by 90 degree during packing process; 
packing arrangements are non-guillotine and 
orthogonal and the stock sheet is rectangular in 
shape, without any bad patches in it. Also, grain 
orientation of the item and object are trivial. The 
packing process has to ensure that there is no 
overlap of items while confining within the 
object. 

A. The items are first sequenced. The criteria 
for sequencing is based on the all-possible 
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aspects related to a rectangle e.g., length, 
width, area, perimeter, length/width ratio, 
priority assigned to items. This priority is 
expressed in terms of urgency for an ordered 
piece or profit associated with it. The lists 
are sorted in increasing order and in 
decreasing order. 

The item thus picked first is placed 
horizontally and then vertically at the lower 
left corner of the object, referred to as its 
reference point. Also, it is important to 
define here Sheet Utilization Ratio as the 
sum total area of ail items placed on the 
stock sheet to total stock sheet area. The 
item placed at reference point of the object 
gives rise to two pivot points. Pivot are the 
top-left and bottom-right corner of the 
ordered piece placed and the top-left and 
bottom-right corner of the ordered piece 
placed and the top-left and bottom-right 
corners of enclosing rectangle that encloses 
all ordered pieces placed so far. Essentially, 
pivot points are the only probable positions 
where next item in sequence can be placed. 
The next item is then placed at each of the 
pivot points, both length-wise and breadth- 
wise for all feasible results. Orientations that 
result in minimum wastage are retained . 



New pivot points are defined and used ones 
are deleted. In the algorithm, pivot points 
are sequenced in the following three ways:- 

• Minimum radial distance (PPD) 

• Minimum x-distance, in case of tie, 
point with minimum y-distance is to be 
served first (PPL) 

• Minimum y-distance, in case of tie, 
point with minimum y-distance is to be 
served first (PPB) 

Total number of feasible layouts thus possible 
are the product of possible orientations of the 
object (2) X possible orientations of the first item 
placed on the object (2) X possible sequencing 
patterns (increasing and decreasing) (2) X 
different basis for sequencing of items 
considered (5) X different sequencings of pivot 
points (along) considered (3)=2 X2X2X5X3 
= 120; Thus OL-IL-I-SL-PPD stands for pattern 
obtained when object is oriented length-wise; 
first-item placed on the reference point is also 
oriented length- wise; item are sorted in 
increasing order and are sequenced on the basis 
of length; pivot points are arranged in increasing 
order along the Diagonal of the object. 



V. EXPERIMENT SETUP 



Test data set, consisting of the objects of different sizes and the items required to the placed on these 
objects, were considered as suggested in Table I i.e., SET-I . 



SET-I 


STOCK SHEET : 2 
LENGTH = 70 WIDTH = 40 QUANTITY : 5 
LENGTH = 40 WIDTH = 40 QUANTITY : 4 


REQIREMENT 


SR.No 


1 


2 


3 


4 j 


5 


6 


7 


8 


9 


10 


LENGTH 


22 


31 


35 


24 


30 


13 


14 


14 


12 


13 


WIDTH 


21 


13 


9 


9 


7 


11 


10 


8 


8 


7 


QUANTITY 


7 


9 


4 


9 


12 


4 


11 


3 


12 


14 



VI. RESULTS 



The revised heuristic has been used on Set-I to 
generate 120 feasible patterns on each type of 
available sheet. In total 34 different layouts 



were observed (22 on sheet (70x40)). All these 
layouts have been tabulated in Table 2 and Table 

3. 
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Table 2 Different patterns generated by sheet type 1 ((70 x 40)) 



PIECES 



QUANTITY 



22X2 
1 



31X13 



35X9 



24X9 



30X7 



12 



13X11 



14X10 



11 



14X8 



12X8 



12 



13X7 



14 



Utilization 
Factor (UF %) 



Pattern 1 







12 



84.31 



Pattern 2 



12 



87.57 



Pattern 3 



11 



82.5 



Pattern 4 



12 



90 



Pattern 5 



12 



14 



86.64 



Pattern 6 



11 



13 



79.64 



Pattern 7 



12 



14 



90.64 



Pattern 8 



92.07 



Pattern 9 



83.18 



Pattern 10 



94.61 



Pattern 1 1 



79.93 



Pattern 12 



88.79 



Pattern 13 



92.79 



Pattern 14 



82.29 



Pattern 15 



85.79 



Pattern 16 



97.61 



Pattern 17 



94.46 



Pattern 18 



93.86 



Pattern 19 



12 



77.82 



Pattern 20 



85.54 



Pattern 21 



11 



14 



83.21 



Pattern 22 



12 



13 



83.39 



Table 3 Different patterns generated by sheet type 2 (40 x 40) 



PIECES 


22X21 


31X13 


35X9 


24X9 


30X7 


13X11 


14X10 


14X8 


12X8 


13X7 


Utilization 
Factor (UF %) 


QUANTITY 


7 


9 


4 


9 


12 


4 


11 


3 


12 


14 


Pattern 23 

















2 








12 





89.89 


Pattern 24 


























12 


3 


89.06 


Pattern 25 














6 

















78.75 


Pattern 26 





























14 


79.63 


Pattern 27 


























1 


14 


85.63 


Pattern 28 


























12 


2 


83.38 


Pattern 29 


1 














4 








2 


2 


88 


Pattern 30 


1 














4 


1 





1 


2 


90.75 


Pattern 31 








4 























78.75 


Pattern 32 


1 


1 











1 





2 


1 





83 


Pattern 33 


1 


1 











1 





2 





1 


82.69 


Pattern 34 





3 


1 























95.25 



In the corresponding LPP, let x x represents 

number of times patterns 1 is to be used , X 2 

represents number of times pattern 2 is to be 
used and so on. Accordingly the mathematical 
formulation for Set-I is as suggested below and 
obtained result is summarized in table 4. 



Objective Function 

Maximize z=84.31x 1 +87.57x2+90x3+86.64x4+90.64x5 +92.07^ +83. 18x7+94.61x^79.93x9 

+88.79x 10 +92.79x n +82.29x 12 +85.79x 13 ^ 

82.5x^+79.96x20+83.21x2! +83.39x22 +89.89x23+89.06x24+78.75x25+79.63x^+85.63x^+83.3 

+88x 9Q +90.75x, n +78.75x, 1 +83x, 9 +82.69x„+95.25x, 4 
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Subjective to Constraints (Requirement for 
each object in the order list) 



[Natural Constraints] 

x , > / = 1,2 34 

Optimal Solution of the Rectangle 
Packing Problem is: 

Xi =X = X^ = Xr = Xr=X e = X Q = X ln = X n =X n =X 



*1 A 2 A 4 A 5 A 6 



9 A 10 A 12 A 13 A 15 



12xi +Hxy +Hxa +12x^ +5x/- 4xo +12xi 7 +1 lx™ +1 lx™ +12x-p +12x ™ +12x^ 4x^y 



^16 



M7 x 18 x 19 x 20 x 21 x 22 



= 



A 23 A 24 A 25 A 26 A 27 A 28 A 29 A 31 A 33 A 34 
x 3 = x 6 = x 7 =x ll = x 14 = x 32 = 1 » x 30 = 8 



[Reserve of stock sheets] 

Table: 4 Description of result produced by LPP of SET-I 



Variable 


TPP 


UF (%) 


No. of Sheet 


Sheet Type 


Heuristic 


x 3 


12 


90 






OB-IL-I-SW-PPL 


X 6 


27 


90.64 






OL-IB-I-SA-PPL 


x 7 


10 


83.18 






OB-IL-I-SAR-PPL 


X ll 


9 


92.79 






OB-IB-D-SL-PPL 


X 14 


7 


93.86 






OB-IB-D-SL-PPL 


X 30 


9 


90.75 


8 


2 


OB-IB-I-SAR-PPL 


X 32 


6 


83 


1 


2 


OB-IB-D-SB-PPL 



Now the revised heuristic is applied in steps 
while selecting the best layout (A solution with 
maximum sheet utilization factor) and step by 
step exhausting the demand for different pieces. 



The solution obtained is tabulated (table 5) 
below. 



Table: 5 Description of result produced by Revised Rectangle Packing Heuristic of SET-I 



UF (%) 


TPP 


No. of Sheet 


Sheet Type 


Heuristic 


97.61 


07 






OL-IL-D-SP-PPL 


95.35 


09 






OB-IB-D-SP-PPL 


96.21 


10 






OL-IB-D-SA-PPL 


97.68 


15 






OL-IB-D-SL-PPL 


95.82 


14 






OL-IB-D-SB-PPL 


89.94 


09 




2 


OL-IL-D-SL-PPL 


89.44 


15 




2 


OL-IL-D-SAR-PPL 


67.13 


06 




2 


OL-IB-D-SL-PPL 
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VII. CONCLUSION 

On comparing the two results as 
summarized in Table 4 and Table 5 for Set-I, 
it has been observed that solution obtained 
by using revised heuristic in 



and step by step exhausting the demand in 
comparison to LPP meeting demand 
wholesomely makes use of smaller number 
of stock sheets and produces lesser number 
of surplus ordered pieces. 



Table: 6 Comparing between Linear Programming Packing Solution and Revised Rectangle 
Packing Heuristic Packing Solution for Set-I 



Revised Rectangle Heuristic 
Packing Solution 


Linear Programming 
Packing Solution 




085 


143 


Total pieces placed 


005 


005 


Sheets (70x40) used 


003 


009 


Sheets (40x40) used 


008 


014 


Total Sheets used in the approach 


000 


058 


Surplus pieces 
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Abstract — Generally, see Localization of nodes is a key technology 
for application of wireless sensor network. Having a GPS 
receiver on every sensor node is costly. In the past, several 
approaches, including range-based and range-free, have been 
proposed to calculate positions for randomly deployed sensor 
nodes. Most of them use some special nodes, called anchor nodes, 
which are assumed to know their own locations. Other sensors 
compute their locations based on the information provided by 
these anchor nodes. This paper uses a single mobile anchor node 
to move in the sensing field and broadcast its current position 
periodically. We provide an adaptive weighted centroid 
localization algorithm that uses coefficients, which are decided by 
the influence of mobile anchor node to unknown nodes, to 
prompt localization accuracy. We also suggest a criterion which 
is used to select mobile anchor node which involve in computing 
the position of nodes for improving localization accuracy. The 
localization accuracy of adaptive weighted centroid localization 
algorithm is better than maximum likelihood estimation which is 
used very often. 

Keywords-component; Weighted Centroid Algorithm; Wireless 
Sensor Networks; Localization; 



I. 



Introduction 



A wireless sensor network (WSN) consists of spatially 
distributed autonomous sensors to cooperatively monitor 
physical or environmental conditions, such as temperature, 
sound, vibration, pressure, motion or pollutants. The 
development of wireless sensor networks was motivated by 
military applications such as battlefield surveillance. They are 
now used in many industrial and civilian application areas, 
including industrial process monitoring and control, machine 
health monitoring, environment and habitat monitoring, 
healthcare applications, home automation, and traffic control. 

A sensor network normally constitutes a wireless ad-hoc 
network, meaning that each sensor supports a multi-hop routing 
algorithm (several nodes may forward data packets to the base 
station). In computer science and telecommunications, wireless 
sensor networks are an active research area with numerous 
workshops and conferences arranged each year. The 
applications for WSNs are varied, typically involving some 



kind of monitoring, tracking, or controlling. Specific 
applications include habitat monitoring, object tracking, 
nuclear reactor control, fire detection, and traffic monitoring. In 
a typical application, a WSN is scattered in a region where it is 
meant to collect data through its sensor nodes. 

A sensor network is composed of a large number of sensor 
nodes that are densely deployed in a field. Each sensor 
performs a sensing task for detecting specific events. The sink, 
which is a particular node, is responsible for collecting sensing 
data reported from all the sensors, and finally transmits the data 
to a task manager. If the sensors can't directly communicate 
with the sink, some intermediate sensors have to forward the 
data [1]. 

There are several essential issues (e.g., localization, 
deployment, and coverage) in wireless sensor networks. 
Localization is one of the most important subjects for wireless 
sensor networks since many applications such as environment 
monitoring, vehicle tracking and mapping depend on knowing 
the locations of the sensor nodes. In addition, with location- 
based routing protocols, both routing and data forwarding are 
determined based on the geographic location [2]. 

To solve the localization problem, it is natural to consider 
placing sensors manually or equipping each sensor with a GPS 
receiver. However, due to the large scale nature of sensor 
networks, those two methods become either inefficient or 
costly, so researchers propose to use a variety of localization 
approaches for sensor network localization. 

These approaches can be classified as range-based and 
range-free. Firstly, the range-based approach uses an absolute 
node-to-node distance or angle between neighboring sensors to 
estimate locations. Common techniques for distance or angle 
estimation include received signal strength indicator (RSSI), 
time of arrival (TOA), time difference of arrival (TDOA), and 
angle of arrival (AOA). The approaches typically have higher 
location accuracy but require additional hardware to measure 
distances or angles. Secondly, the range-free approach does not 
need the distance or angle information for localization, and 
depends only on connectivity of the network and the contents 
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of received messages. For example, Centroid method, APIT 
method, DV-HOP method, Convex hull, Bounding box, and 
Amorphous algorithm have been proposed [3] [4] [5]. Although 
the range-free approach cannot accomplish as high precision as 
the range-based [6], they provide an economic approach. Due 
to the inherent characteristics (low power and cost) of wireless 
sensor networks, the range-free mechanism could be a better 
choice to localize a sensor's position, so we pay more attention 
to range-free approach in this paper. 

This paper uses a single mobile anchor node as the 
reference node, which is required to move in the sensing field 
and broadcast its current position periodically. Sensor nodes 
receive the position information of the mobile node and 
localize themselves to the cenrroid of these positions by using 
adaptive weighted cenrroid algorithm. The algorithm based on 
the Received Signal Strength Indication (RSSI). The results of 
simulations show that the method is a practical method that can 
be used in real-world system, and is also a method whose 
principle is simple, less computing and communication, is low 
cost, and provides flexible accuracy. 

II. Related Work 

In the past several years, extensive research has been done 
on localization for wireless sensor networks. A general survey 
is found in. Here we provide only a brief survey about range- 
free approaches and localization method, which involve mobile 
reference nodes. Some nodes are equipped with special 
positioning devices that are aware of their locations. These 
nodes are called anchor nodes or reference nodes. Other nodes 
that do not initially know their locations are called unknown 
nodes or sensor nodes. Generally, an unknown node can 
estimate its location by range-based or range-free methods if 
three or more anchors are available in its coverage field. 
Obviously, the number and position of anchor nodes have a 
noticeable influence on the localization precision. 

The main idea of localization with a mobile anchor node is 
as follows: After sensor deployment, a mobile anchor node 
traverses the sensor network while broadcasting anchor 
packets, which contain the coordinates of the anchor node. 
Sensor nodes receiving anchor packets could infer their 
distance from a mobile anchor node and use these 
measurements as constraints to construct and maintain position 
estimates. These methods have a common feature: they use 
range-based approaches. Though they can reach fine resolution, 
either the required hardware is expensive (ultrasound devices 
for TDOA, antenna arrays for AOA) or the results depend on 
other unrealistic assumptions about signal propagation (for 
example, the actual received signal strengths of radio signals 
can vary when the surrounding environment changes). Due to 
the hardware limitations of sensor devices, range-free 
approaches are a cost effective alternative to a more expensive 
range-based approach. A simple algorithm proposed, computes 
location as the centroid of its proximate anchor nodes. An 
alternate solution, DV-Hop, extends the single hop broadcast to 
multiple-hop flooding, so that sensors can find their distance 
from the anchors in terms of hop counts. An amorphous 
positioning scheme adopts a similar strategy as DV-Hop; the 
major difference is that Amorphous improves location 
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estimates using offline hop-distance estimations through 
neighbor information exchange. Another existing range-free 
scheme is an APIT algorithm. APIT resolves the localization 
problem by isolating the environment into triangular regions 
between anchor nodes. A node uses the point-in-triangle test to 
determine its relative location with triangles formed by anchors 
and thus narrows down the area in which it probably resides. 
APIT defines the center of gravity of the intersection of all 
triangles that a node resides in as the estimated node location 
[7][8][9]. 



Based on these analyses, localization using a single mobile 
anchor node would be more economical. In addition, 
considering the constraints in computing and memory power of 
sensors, we adopted the weighted centroid method with a 
single mobile anchor to locate sensors in wireless sensor 
networks. Depending on the method used for ranging, an 
appropriate localization technique is applied in the second 
phase. The following localization strategies have been 
proposed. 

A Trilateration 

This is one of the more popular strategies and is used when 
the exact distances between known points and an object to be 
located are available. Fig. 1 shows when the distance between 
an object and three points are given, the object's location x can 
be computed as the intersection of three circles centered at the 
known points. 




Figure 1. Example of Trilateration 



B. Bounded Intersection 

The trilateration technique works well when the three 
circles intersect at a single point, but this is rarely the case 
when estimates are used in ranging. For example, when using 
incremental stepping of transmission power for ranging, 
maximum values can be used for estimating the distances. Fig. 
2 shows The object to be located would fall into a geometric 
region that is the intersection of three circles. 




Figure 2. Localization with Maximum Bounds 
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C. Triangulation 

The triangulation method is useful if the angle between two 
objects can be measured. Fig. 3 shows an example. Suppose PI 
and P2 are points with known locations and X is an object to be 
located. Nodes PI and P2 can measure angles al and a2 , and, 
with known distance Sx , one can easily compute ax , SI and 
S2. 




Figure 3. Example of Triangulation 



D. Maximum Likelihood 

When estimates are used for ranging, it is possible that the 
region of intersection is empty. This will occur if at least one 
ranging estimate is too small. One method that overcomes this 
problem selects the point for localization that gives the 
minimum total error between measured estimates and 
distances. In Fig. 4, distance estimates (dl, d2, d3) are made 
between the object to be located and three points (PI, P2, P3) . 
The errors (el, e2, e3) are computed by finding the difference 
between the actual Euclidean distances and the ranging 
estimates. 
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nodes. This can be done by driving the mobile anchor node to 
move in a spiral trajectory. Obviously there are many other 
options to moving trajectory. Finding an optimal trajectory to 
cover all sensor nodes can be a research topic for our future 
work. No matter which trajectory is used, the location of the 
mobile anchor node on the trajectory should be known. At the 
same time, we assume that the mobile anchor has sufficient 
energy for moving and broadcasting its information during the 
localization process. The speed of the mobile anchor is 
adjustable and unrestricted, but uniform in the process of 
location. 

We used an idealized radio model for wireless 
communication because it was simple and easy to reason 
mathematically. We assumed that our idealized model is 
perfect spherical radio propagation and has identical 
transmission range (power) for all radio positions as shown in 
Fig. 5. It is a sphere with the anchor as its center and the 
broadcasting radius R as its radius. Only the sensors within the 
range are assumed capable of receiving the information sent by 
the anchor. 




III. 



Figure 4. Localization with Maximum Likelihood 

Adaptive Weighted Centroid Localization 
Method 



A Method of Localization 

This method can be used in large-scale field environment. 
Figure 1 illustrates the system environment where a sensor 
network consists of a mobile anchor node and unknown nodes 
that could be scattered from a plane or from a mortar shell. The 
mobile anchor is a human operator or an unmanned vehicle 
deployed with the sensor network. If the network is deployed 
by plane scattering, this anchor can be even the plane itself. 
The unknown nodes are the nodes of initially unknown 
positions. Once the nodes are deployed, they will stay at their 
locations to conduct the sensing task. The mobile anchor, 
which is a node aware of its location (e.g. equipped with GPS), 
and is able to traverse for assisting the sensors to determine 
other node locations [10]. The mobile anchor node needs to 
traverse over the entire region in order to cover all sensor 



• Anchor node 
Mobile anchor trajectory / 

^•r* v • 


• 


-* * * * 

^. •• •."• 


Unknown node 


• m • Deployment arec 



Figure 5. System Environment with a Mobile Anchor Node 

In this paper, we proposed the location of mobile anchor 
node influence: In the localization algorithm, location of 
mobile anchor node has influence to the unknown nodes, RSSI 
bigger location, and the greater influence on the location of 
sensor nodes. When Unknown node received multiple mobile 
anchor node position signal then unknown node by the impact 
of these locations. Location of largest RSSI has the greatest 
power to decide to the position of sensor node. 

Signal selection principle: An unknown node may receive 
multiple signals of positions from the mobile anchor node. 

RSSI value should be the largest of several signals position 
calculation. Location computed to ensure that the signals 
involved in more than three. Will be distances of more than R 
the location of mobile anchor node removed, so as to avoid the 
expansion of the positioning error. Behind the simulation 
proves this point. 

B. Adaptive Weighted Centroid Localization Algorithm 

Through the front of the Analysis, can find common 
centroid algorithm, did not reflect the mobile anchor node's 
influence, affecting the localization accuracy. To enhance the 
localization accuracy, in this paper we used the adaptive 
weighted centroid localization algorithm. Its main idea: In the 
algorithm, mobile anchor node confronts the right to decide the 
location of the centroid through weighted factor to reflect. The 
use of weighted factor reflected the intrinsic relationship 
between them. 
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We embody this relationship through the formula of the 
weighted factor: s) Calculates the mean value(XJ) of C_n A 3 coordinate. 

The (X,Y) is Unknown node coordinate. 



X = 



,X1 X2 X3, 

( — + + ) 

A 1 1 . 

( — + — + — ) 
dl d2 d3 



,Y1 72 73, 

( — + — + — ) 
d3 Y= dl d2 d3 m 

' ,1 1 1, K) 

( — + — + — ) 
dl d2 d3 



Fig. 6 illustrates Known 3 mobile anchor nodes coordinate 
(XI, Yl), (X2, Y2), (X3, Y3), unknown node to anchor nodes 
distance dl, d2, d3. According to the formula can be calculated 
unknown node coordinates (X, Y). Compared to ordinary 
centroid algorithm, 1/ dl, 1/ d2, 1/ d3 is weighted factor. The 
factor 1/ dl, 1/ d2, 1/ d3 indicates that mobile anchor node with 
a shorter distance to unknown nodes has a larger infect its 
coordinates. We can improve the localization accuracy from 
these inner relations. 




Figure 6. Scheme of the adaptive weighted centroid localization algorithm 

Adaptive Weighted Centroid Localization Algorithm 
process : 

1) The mobile anchor node periodically sends its own 
information. 

2) Unknown node received information, only records the 
same location of the mobile anchor node average RSSI. 

3) Unknown node received over threshold m in the 
position information then RSSI value in accordance with the 
smallest sort of mobile anchor node location And to establish 
the mapping between RSSI value and the distance from 
unknown node to the mobile anchor node. The establishment 
of three sets: mobile anchor node_set-{al , a2 ,-, am}; 
Distance_set={dl ,d2 , - , d m}; Mobile anchor node 
position_set={(Xl , Yl ), (X2 , Y2), —, (Xm , Ym)}; 

4) RSSI value with the first few large location of mobile 
anchor node of the calculation: 

a) Based on the preceding analysis, In the mobile 
anchor node_set Select RSSI value of large node location then 
the composition of the triangle set. This is very important. 
Triangle_set={( al , a2, a3), (al, a2 , a4), •- (al, a3 , 
a4), (al, a3 ,a5) —}; 



IV. Experiments 

A. Simulation Environment 

The key metric for evaluating a localization technique is the 
accuracy of the location estimates versus the communication 
and deployment cost. To evaluate this proposed method we use 
UNIX, programs with the C language. We have carried on the 
computer simulation to the above algorithm. Simulation 
condition: The mobile anchor node reference MICA2 mote; 
Uses outdoor launches the radius 200 to 300m; Deployment 
area is 200*200. The unknown node arranges stochastically; 
the unknown node is 220. The mobile anchor node has 6 kinds 
of situations: 9, 12, 16, 20, 25, and 30 positions. 

B. Results and Analysis 

The simulation uses adaptive weighted centroid localization 
algorithm and maximum likelihood estimation method. 
Localization accuracy mainly depends on the numbers of the 
mobile anchor node broadcasting its positions or the anchor 
density. It is very easy for our method to change anchor density 
by adjusting the interval time or the moving length of the 
mobile anchor node broadcasting its positions or by changing 
the moving interval of spiral line. In comparison with other 
methods, this is one of the advantages with our method, and it 
does not require additional hardware. Figure 7 and Figure 8 
show the simulation result. In the figure 7 error of adaptive 
weighted centroid localization algorithm is 16.2m and error of 
maximum likelihood estimation is 24.2m when mobile anchor 
node is 9. In the figure 8 error of adaptive weighted centroid 
localization algorithm is 21.5m and error of maximum 
likelihood estimation is 40.5m when mobile anchor node is 30. 
As can be seen from the figure adaptive weighted centroid 
localization algorithm has better localization accuracy. Has the 
obvious superiority, if the anchor density is low. Adaptive 
Weighted Centroid Localization Algorithm is simple, and no 
communication is needed while locating. It does not require 
additional hardware. The mobile anchor node can be used 
many times. So it is very inexpensive. 



-Maximum Likelihood Estimation Method 
-Adaptive Weighted Centroid Localization Algorithm 




16 20 25 

Anchor node number 



Figure 7. Average Error 



5) n location of mobile anchor nodes can be composed of 
C_n A 3 triangles. The use formula (1) calculates C_n A 3 
coordinate. 
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— •— Maximum Likelihood Estimation Method 
—■—Adaptive Weighted CEntnoid Localization Algorithm 




12 1£ 2D 2B 30 

Anchor node number 



Figure 8. Maximum Error 



V. Conclusion and Future Works 

The Many wireless sensor network applications depend on 
nodes being able to accurately determine their locations. This is 
the first work to study range-free localization in the presence of 
mobility. One of our ideas is that a mobile anchor can improve 
the localization accuracy and coverage because it can move to 
every point of wireless sensor networks. Another factor is that 
range-free requires no extra hardware or data communication 
and reduces the costs of localization. Our simulation 
experiments reveal that our method can provide accurate 
localization even when memory limits are severe, the seed 
density is low, and network transmissions are highly irregular. 

Many issues remain to be explored in future work including 
how to select a moving path to improve the locating 
performance, how to apply this to real-world sensor networks 
and how to expend our method to other applications. 
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Abstract — The RNA molecule is substantiated to play 
important functions in living cells. The class of RNA with 
pseudoknots, has essential roles in designing remedies for 
many virus diseases in therapeutic domain. These various 
useful functions can be inferred from RNA secondary 
structure with pseudoknots. Many computational intensive 
efforts have been emerged with the aim of predicting the 
pseudoknotted RNA secondary structure. The computational 
approaches are much promising to predict the RNA structure. 
The reason behind this is that, the experimental methods for 
determining the RNA tertiary structure are difficult, time- 
consuming and tedious. In this paper, we introduce ABCRna, a 
novel method for predicting RNA secondary structure with 
pseudoknots. This method combines heuristic-based 
KnotSeeker with a thermodynamic programming model, 
UNAFold. ABCRna is a hybrid swarm-based intelligence 
method inspired by the secreting honey process in natural 
honey-bee colonies. The novel aspect of this method is adapting 
Case-Based Reasoning (CBR) and knowledge base, two 
prominent Artificial Intelligence techniques. They are 
employed particularly to enhance the quality performance of 
the proposed method. The CBR provides an intelligent 
decision, which results more accurate predicted RNA 
structure. This modified ABCRna method is tested using 
different kinds of RNA sequences to prove and compare its 
efficiency against other pseudoknotted RNA predicted methods 
in the literature. The proposed ABCRna algorithm performs 
faster with significant improvement in accuracy, even for long 
RNA sequences. 

Keywords- RNA secondary structure; pseudoknots; Case-Bases 
Reasoning; Artificial Bee Colony (ABC) algorithm. 



I. 



Introduction 



Ribonucleic acid or (RNA) is one of the nucleic acids, 
which plays diverse roles and functions. Basically, one kind 
of RNA is the messenger RNA (mRNA). It works as an 
intermediary in carrying the genetic information code from 
DNA to make proteins [1]. This carried genetic code is used 
in the natural process for synthesizing proteins in living cell. 
However, the recent biological studies confirmed that there 
are other kinds of RNAs, which play various useful roles [2]. 
The latest discovered functions of RNA molecule, include: 
splicing introns, catalyst for reaction and a regular in cellular 
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activities [3, 4]. Predicting the RNA structure is the key to 
determine and scrutinize the active functions of RNA 
molecule. This fact is emphasized by central dogma in 
biochemistry and biology research domain [5, 6]. The RNA 
secondary structural outputs provide the base for shaping the 
RNA three-dimension (3D) structure, which is the first step 
of the RNA tertiary structure phase. 

The importance of the computational methods for 
predicting RNA secondary structure has been acknowledged 
as a demanding research area, by computer scientists. Also, 
there are many conditions, facing the experimental methods 
that are used by biologists [7, 8]. The Nuclear Magnetic 
Resonance (NMR) and X-ray crystallography are the two 
popular experimental purification methods that are used to 
determine the RNA 3D spatial structure [9, 10]. Latest 
studies confirmed that many classes of RNA molecule 
broadly fold in the pseudoknot motif [11, 12]. Whereas, the 
RNA structural functions of pseudoknot elements, have been 
emphasized to be prominent for medical processes and 
designing anti-viral treatments, in therapeutic research [13]. 
Consequently, the computational RNA prediction methods 
for predicting the RNA secondary structures are extensively 
utilized with manageable efforts [14]. 

The RNA molecules come in two main shapes: the Stem- 
loop and the Pseudoknots, as illustrated in Figure 1 in terms 
of RNA structure classifiers [15]. The Stem-loop is a non- 
crossing RNA structure motif. While, the Pseudoknots is a 
crossing RNA structure, which plausibly has been spotted by 
[16]. Further, the pseudoknotted RNAs has been proven to 
play several vital roles. From complexity points of view, the 
top prediction methods of RNA without pseudoknots 
functional element are MFold [17] and Vienna [18] 
algorithms which execute with complexities 0(n 3 ) in time 
and 0(n 2 ) in space. PknotsRG [19] is one of the most proper 
algorithm for predicting RNA with pseudoknots. It requires 
0(n 4 ) and 0(n ) in time and space complexities, respectively. 
Even if the pseudoknotted RNA secondary structure 
prediction problem has been stated as Non-deterministic 
Polynomial time (NP)-Complete problem [20, 21], it is an 
insisted matter to be solved [22, 23], in recent years. 

In order to overcome the prediction problem of RNA 
secondary structure with pseudoknots, this article introduces 
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a nature-inspired hybrid method called "ABCRna". 
Innovatively, this approach combines a new derivation from 
Artificial Bee Colony (ABC) algorithm with a special 
deterministic constraints [24]. On top of this, it is borrowed 
from the Artificial Intelligence (AI) field, which is a kind of 
nature swarm-intelligence [25]. The objective of this 
proposed method is to build the entire RNA secondary 
structure with pseudoknots from a given single- stranded 
RNA primary sequence. Indeed, this proposed method is a 
combination of KnotSeeker (heuristic-based method [3]) 
with UNAFold (a dynamic programming method [26]) for 
solving the RNA structural related issue. This hybrid method 
is a new derivation from ABC algorithm. It adapts the 
inspired swarm-based intelligence behavior of the honey- 
bees in collecting nectar and converting that to honey and 
royal jelly [27]. Naturally, every individual worker bee visits 
many flower patches during the round-trip of collecting 
nectar and pollen. Then it goes back to the hive to submit the 
mixed nectar to the nurse bee. Finally, the nurse bee starts 
making honey by a natural biological secreting process. 

Intuitively, the proposed RNA structural hybrid method 
is deployed and built to solve the related pseudoknotted RNA 
bioinformatics problem. By a deeper understanding of the 
CBR technique [28], the proposed hybrid model obtains a 
global optima RNA structural assurance results with more 
accuracy and better performance. Finally, the results show 
that the ABCRna method significantly improves the 
execution time and the accuracy in both sensitivity and 
specificity. This improvement when comparing the outputs 
with the other pseudoknotted RNA prediction methods 
existing in the state-of-the-art like; FlexStem [29], HotKnots 
[30]andPknotsRG[19]. 

The remainder of this article is ordered as follows: In the 
next section, we start with describing the secondary structure 
of the RNA molecule, in computer context representation. In 
section 3 background materials, gives a concise expression to 
the generic ABC optimization method. Then, a derivation of 
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Figure 1. A stem-loop and pseudoknots of RNA structures types. 



ABC is adapted to generate the proposed method. Next, the 
CBR as a modern AI technique, is extensively and widely 
discussed, from theoretical concept. Section 4 presents the 
proposed method with the implemental mapping between 
pseudoknotted RNA secondary structural prediction and the 
secreting process of making honey. Subsequently, the 
following section reports the comparative benchmark of the 
proposed method. The results of ABCRna is comparing 
against the results of other RNA prediction methods in the 
literature. Finally, the article ends with conclusion remarks, 
in section 6. 



II. Secondary Structures of RNA 

A. RNA Stem-Loop (non-pseudoknots) 

The single- stranded RNA molecule forms many folded 
structures in hierarchal shape; the primary RNA single 
sequence, the secondary structure of RNA molecule, the 
three-dimensional (3D) or tertiary RNA functional structure 
and the quaternary structure for RNA polymerase [31]. 
Generally, the RNA computational methods predict the 
secondary structure of the given RNA primary sequence. 
Thus, the RNA secondary structure defines: as an RNA 
structural motif, which in some parts includes the double- 
stranded motifs. These parts joined by complementary and 
canonical base pairings with the other parts, which are the 
non-paired single bases. The double- stranded motif parts 
coming in several shaped of stem-loops: hairpin, internal (or 
interior), bulge, multi-branch external bases and stacking (or 
helices) loops. As explained above and illustrated in Figure 
2, the RNA primary sequence (RNA bases) folds and joins 
on itself in real RNA secondary structure by hydrogen 
chemical bonds for low energy and more stability [15]. In 
mathematical and computational representation concept, the 
various layers of RNA structures can be defined as follows: 

•b = bj, b 2 , ..., b b ..., b n , where b is an RNA primary 
sequence and b { is the RNA base or nucleotide [32, 33]. 
The element b t is also a member of set which includes 
{ 4 A','C','G','f/',W}. While, the first four alphabets are 
representation of the original paired bases (paired- 
nucleotides) of the real RNA molecule: Adenine, Cy to sine, 
Guanine and Uracil, respectively. The last nucleotide W 
is assigned to the non-paired base. Such that the n is the 
length of the given RNA sequence and 1 < i < n. 

• S ={(bi, bj)}, such that (b h bj) belongs to the canonical base 
pairs. S is the secondary structure of the given RNA 
primary sequence which satisfies the following conditions: 

- (b h bj) E {(A,U), (UA), (G,C), (C,G), (G,U) ,(U,G)}, 
these are the sets of RNA base-pairs. While, the base 
pairs include in the set { A-U , U-A , G-C , C-G} is a 
Watson-Crick RNA base-pairs [34], the set { A-U , U-A} 
is a Wobble RNA base-pair [35]. 

- Then S = {{b h bj): 1 < i < j < n and; - i > Const }, 
where Const is a threshold constant number depend on 
the limit length of the minimum un-paired bases in a 
stem-loop (hairpin, stem or bulge ... etc). The Const is 
typically taken to be equal three. 
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If (b h bj) E S, (b h b t ) E S and if b ( = b h then bj = b h This 
implies (b h bj) = (b k , bj). In another words, every base 
(nucleotide) in RNA secondary structure make join by 
hydrogen bond at most with another one base (non-triple 
or only allow one-to-one). 

If (b b bj) E S, (b h bi) E S and i < /e, this can include two 
location elements in RNA stem-loop structure (non- 
pseudoknots): 

■ If i < k < I < j , then the two base pairs are form a 
type of nested location elements (nested-fashion), as 
depictured in Figure 3 a. 

■ If i < j < k < I , then the two base pairs are form a 
type of juxtaposed location elements (juxtaposed- 
fashion) [36], as shown in Figure 3 b. 



B. RNA with Pseudoknots 

The majority of RNA molecule classes fold in functional 
structural elements called pseudoknots. Indeed, they belong 
to the (3D) tertiary structure element and perform an 
important useful roles and constructive functions [37]. 

The pseudoknots substructure can theoretically satisfy the 
following term. If there are two base pairs (b h bj) and (b h bj), 
then satisfy the conditions: i < k < j < I or k < i < I < j, 
as shown in Figure 3 c and d. These two base paired shapes 
are represented the pseudoknots RNA structural elements. In 
another word, the pseudoknots is a crossing sub-structural 
functional element in the RNA molecules. It forms 
interaction the unpaired bases part of the stem-loop, which 
folds back and join in a loop region located outside that 
stem-loop. 

In spite of the prediction algorithms of RNA with 
pseudoknots structural elements, have been proven to be NP- 
complete problem [21]. It is a demanding research area 
because of the pseudoknotted RNAs has importance as key 
functions. Further it plays essential roles in viral and cellular 
regulatory [381. 
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Figure 2. Different RNA element shapes, the image is adapted from [39]. 



Figure 3. The diagrammatic position relation between different types of 

RNA base pairs, (a) two base-pair in juxtaposed fashion, (b) two base-pair in 

nested fashion. (c)&(d) two base-pair in pseudoknots. 

Ill . B ACKGROUND MATERIALS 

A. Problem Statement of RNA with Pseudoknot 

Pseudoknotted RNA secondary structure is the problem 
of predicting its secondary structure from a given primary 
sequence. Particularly, it has recently become attractive 
research area. Due to that the RNA with pseudoknots, has 
many important and useful roles, which needs to be solved 
computationally [40]. The existing pseudoknotted RNA 
prediction algorithms perform in exponential time 
complexity. The best prediction method run, in the worst 
case, 0(n ) in time and 0{n) in space [19]. Thus they run 
very slowly and need an ever increasing memory-space, 
especially for long sequences. Veritably, this means that the 
prediction solving algorithms of the pseudoknotted RNA 
secondary structural problem, suffer from long execution 
time and storage complexities. To the best knowledge of the 
authors, the final structural results suffer from poor quality 
and inaccuracy, for long RNA sequences. 

The pseudoknots class of the RNA structural prediction 
issue, has been proven an NP-complete problem [20]. 
Increasingly, the collecting nectar to make honey is an 
inspired field for the bioinformatics researchers, which is 
derived from the original ABC model [24]. In this article, a 
new hybrid method as a sub-area of swarm intelligence 
approaches for solving the pseudoknotted RNA structural 
problem is adapted. Besides that the CBR as a modern AI 
technique highlighted a way to be deployed, in term of 
enhancement the final results of the proposed hybrid 
ABCRna model. From comparison points of view, we find 
this method improved the accuracy of the RNA structural 
outputs with good performance. 

B. Swarm-Intelligence in AI Technology 

Swarm Intelligence (SI): is an emergent and bioinspired 
field of AI, which has been generated from numerous 
researches in social insect's behavioural models [41]. The 
phrase swarm comes up to present solution to overcome the 
optimization problems. These optima solutions have been 
successfully got by utilizing the co-operative and 



291 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 8, No. 8, 2010 



coordinative efforts among the worker-insects. The 
inspiration of the swarm intelligence is gained from many 
social insects behavioral models like; honey-bees colony and 
ant-colony. For instance in bee-colony, the objective of the 
swarm is the quantity and quality production of honey by the 
mutual teamwork. It is a key fact that, the amount of honey 
that an individual worker-bee harvests is worthless. But, the 
honey production by all worker-bees is considerably much 
better than the crop of an individual one [42]. 

Lately, swarm intelligence has obtained high interest to 
be adapted by many researchers from diverse fields. The list 
compromises, but it is not limited of: engineering, science 
and commerce fields. The computer researchers propose 
swarm intelligence optimization methods to solve many 
complex problems that suffer from severe drawbacks. The 
typical research domain of the computational swarm 
intelligence is to solve many real- world problems. Some 
applications of swarm intelligence in a development areas as 
follows: (i) The routing optimization in different 
communication network [43]. (ii) The job scheduling [44]. 
(iii) The swarm control in the Unmanned Aerial Vehicles 
(UAV) for both civil-military purposes [45, 46]. 

C. Honey-Bee Colony Structure 

Many social insects live in colonies have instinctual 
ability to perform as agents in a group for solving complex 
problems and to complete their tasks. The new AI 
disciplinary "swarm-intelligence" has been attractively 
produced by deep knowledge of the biological swarm in 
solving the problems. This can done by a behavioral 
interaction among thousands members of the swarm-insects 
[47]. Naturally, the social insects have talent to be in self- 
organized behavioral models for achieving an intelligence 
solution of the vital tasks. 

Honey-bees live in a well structured social insect's 
colony called a hive. The hive typically is a composition of a 
solo queen, drones and workers [48]. Each one does the 
following roles: (i) As usual, there is one queen. She is egg- 
laying, female as a mother for other colony members and 
mates one time in her lifelong by drones, (ii) There are 
drones or male bees as bee-colony fathers. Their main 
responsibility is fertilizing the new queen in a mating flight 
party (social gathering) before dying. They live at most six 
months and reach to hundreds up to several thousands during 
the summer season, (iii) There are around 10,000 in winter to 
60,000 in summer female worker-bees (foragers) in each 
bee-colony. They do many important jobs including: 
collecting nectar to make food, raising and bringing up the 
broods and larvae's, guarding and ventilating the hive. But, 
the primary resourceful task of the worker-bees is collecting 
the nectars and pollens from the flower patches (forage 
field). Later, when they back to the hive the worker bees 
secret the honey and royal jelly (food). 

D. Honey-bee Collecting Nectar (Foraging) 

Honey-bees collecting nectar process to make honey is to 
be considered as an optimization swarm-based intelligence 



approach [49]. The worker-bees perform the collecting 
nectar and secreting honey process in a well-organized 
behavioral model known as bees foraging process [50]. It is 
obvious that, this gigantic task is beyond the ability of every 
worker-bee individually. Nevertheless, all the group 
members interact among each other in a fashion to solve the 
collective bee-foraging problem. 

The main incentive task in bee's colony is the foraging 
(collecting nectar to make honey). To investigate the bee 
foraging process Seeley in [51], introduced a detailed 
systematic mechanism. It is about the self organized 
honeybee's social behavioral model in collecting forage, as 
shown in Figure 4. In the proposed system, every worker bee 
(forager) visits many flowers from the same type within 30 
to 120 minutes of foraging trip. All the collected nectars, 
from these flower patches, have been stored in the forager 
honey stomach. Besides that, the forager commits several 
actions to provide a feedback. Waggle dance is providing the 
profitability rating of nectar in the flower patches, the odor, 
location and other required information [52, 53]. 
Accordingly, the making honey and royal jelly process starts 
when the worker-bee back to hive from the foraging round- 
trip journey. 

Soon after reaching the hive from the foraging trip, the 
field bee (forager) gears up to submit that nectar, which 
already stored in her honey sac [54]. This process of 
submission the gathered nectar to the house bee (nurse bee) 
is accomplished in a regurgitated behavior. The role of the 
house bee is converting that nectar to honey or royal jelly 
(bee food) in a secreting process. In this synthesizing honey 
process, the main work is to split the complex sucrose sugar 
into fructose and glucose, which are simpler sugars and 
predominant in honey. This sucrose- splitting process is 
performed by adding the invertase, which is a special 
enzyme, to the nectar from the hypopharyngeal gland in the 
head of bee. Then, the new synthesized honey or royal jelly 
is spread out in a honey comb cells. The house bee exposes 
this secreted honey as a thin film to aware of the last 
filtration. This final step was done by increasing the surface 
area, to insure the fast evaporation of water in the well-done 
honey. Finally, the filled honey comb cells sealed and capped 
by propolis (plant gum), which is an adhesive material. This 
waxy cover prevents the honey from the bacterial attacks or 
in case of prevention the stored food to avoid the 
fermentation. 

Consequently, here the details of the foraging process are 
presented to make a base for our nature-inspired method. It is 
a hybrid adaptation from the process of honeybees in 
collecting nectar to make honey and royal jelly. The 
proposed ABCRna method solves the secondary structure 
prediction problem of RNA with pseudoknots. The idea is 
stimulating a hybrid novelty swarm-intelligence approach 
from collecting nectar and making honey in the natural 
secretion process. ABCRna as a new optimization algorithm 
is based on the main features of a hybrid between two 
heuristic-based method KnotSeeker [3] and dynamic 
programming algorithms UNAFold [26]. 
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Figure 4. A secreting honey process: the simulation of honeybees 
collecting nectar to secret honey and royal jelly mapping with a 
diagrammatic representation of the proposed method. 



E. CBR and KB 

It's commonly known that the AI research area provides 
many methodologies and technologies for solving complex 
problems, which the CBR is one of them. Recently, the CBR 
has been successfully used to restore solution for a new 
problem based on expertise by retrieving the similar mature 
solutions of the past problems [55]. Originally, CBR comes 
up from the cognitive science and the human expertise to 
retain and retrieve the information. In another word in CBR 
method, the people solve the new problem by recalling how 
they solved the past similar problems. The CBR method 
includes a problem solving cycle with four main activities: 
Retrieve, Reuse, Revise and Retain [56]. According to the 
Figure 5, in the heart of this four-RE's cycle there is a case- 
library as a Knowledge Base (KB). This KB is used in 
retrieval action to assess an intelligent decision of the similar 
cases for revising the final outputs by retrieving the most 
correct solutions. 

By referring to the adhere of exact matching concepts, 
the CBR is a generic AI methodology in problem solving 
[57]. In the proposed ABCRna method, the CBR is deployed 
as a modern AI inspired technique with KB to augment the 
result in retrieval steps. The role of CBR is finding the 
current pseudoknotted RNA sub-structure with the exact 
matching from KB. The KB holds and clusters all real 
pseudoknotted RNA sequences and their known native 
structures. If the retrieval one has pseudoknots in its 
secondary structure, then the CBR chooses the current one. 
This CBR comparing process, enhances the quality of the 
predicted pseudoknotted RNA secondary structure. 
Moreover, it is deployed significantly to be an alternative 
development technique for solving the secondary structure 
prediction problem of RNA with pseudoknots. 

F. Preliminarly in Optimization 

According to the theoretical viewpoint, the optimization 
methods are branch of the applied mathematics and basically 
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Figure 5. The Case-Based Reasoning (CBR), a modern Artificial 
Intelligence methodology, adapted from [55]. 



compromise from two main classes of algorithms; 
deterministic and probabilistic. Figure 6 shows the general 
category of the global optimization methods to clear the 
relation among all their characteristics. Definitely, the 
deterministic algorithms are a type of algorithm which take a 
set of fixed inputs and produce a fixed result. While, the 
heuristic is a single assumption works as a search strategy or 
technique in problem- solving. It is based on intelligence and 
experience, which can be applied loosely in computer 
implementation [58]. The meta-heuristic is based-on several 
assumptions work as an optimizer to improve a series of 
candidate solutions to reach to the final problem solving. 
Also it may use the many trials iteratively. In 2001, Geem et 
al. introduced the Harmony Search (HS) algorithm, which 
was a new meta-heuristic algorithm based on natural-inspired 
phenomena behavioral models [59]. The HS has been 
developed from mimicking the natural phenomena of the 
musicians improvisation (music players). Several 
experiments proved that the HS as a meta-heuristic 
algorithm, is capable to solve the optimization problems with 
more improved performance. The result makes the HS as a 
durable meta-heuristic algorithm in solving the NP-complete 
problems. The Traveling Salesman Problem (TSP) is an 
example of NP-problem which was solved by HS [60]. 

Now the main question, Is it feasible to develop a hybrid 
meta-heuristic algorithm for building the pseudoknotted 
RNA structure with good performance and more accurate 
result? To do this an optimized swarm-based intelligence 
algorithm would be inspired as a kind of stimulation from the 
Artificial Bee Colony (ABC) algorithm [61]. This inspired 
proposal utilizes the ABC to solve the related issue of RNA 
structure in bioinformatics. Moreover, the Particle Swarm 
Optimization (PSO) is a distinguished swarm-based 
intelligence algorithm that models some animal social 
behavior like fish schooling or swarm of honey-bees [62]. 
PSO has been proposed by Kennedy in 1995 and has reached 
to be an interesting area of knowledge to exploit for 
developing a new meta-heuristic algorithm by mimicking 
and inspiring the natural phenomena of animals and colony 
insects. 
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Figure 6. A schematic diagram of the global optimization methods. 

IV. Proposed Method 

This section explains in details, the new hybrid of derived 
ABC algorithm to overcome the pseudoknotted RNA 
secondary structure prediction problem. The proposed hybrid 
ABCRna method is inspired from the swarm-intelligence 
social behavioral model of honey-bees in collecting nectar 
and secreting honey, as shown in Figure 7. Hence, the 
authors develop ABCRna as a hybrid method in a simple 
way to build the secondary structure of RNA molecule with 
pseudoknots. The following sub-sections demonstrate 
separately the paradigms of designing the proposed method. 
These sub-sections describe the mapping of the all features 
between the ABC optimized algorithm and the RNA 
structural prediction problem. The final computational results 
of ABCRna for RNA structure reveal an optimized better 
performance and more accuracy in terms of sensitivity and 
specificity. Its computer code implementation shows less 
space and time complexities when comparing with other 
state-of-the-art methods in solving such RNA prediction 
problem. 

Here, the researchers underline the hybrid adaption 
model as a new derivation from ABC algorithm to solve 
RNA prediction problem. It is a first threshold further opens 
the door in front of the other bioinformatics researchers to 
follow. Furthermore, it gives immense opportunity to expand 
this proposed optimizer in solving such kind of complex bio- 
computing problems. This is why the AI material already has 
presented in the background section to be a general guidance. 



A. Honey-bee Foraging Algorithm 

The innovative ABC as a swarm-based intelligence 
algorithm was deployed particularly based on the honeybee 
natural social behaviours. A few other algorithms have been 
derived by inspiring the honeybees swarm behavioral model, 
intelligently [61]. Many researchers have been adapted such 
this swarm collective behaviours to solve optimization 
combinatorial problems. Herein, we describe a new hybrid 
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Figure 7. Workflow of the ABCRna approach for predicting the 
pseudoknotted RNA secondary structure, some parts adapted [55]. 



algorithm called "ABCRna", which is derived from the 
original ABC algorithm [24]. It is developed as a hybrid 
adaptation between ABC models with deterministic 
constraints and inspired by the intelligence social behaviours 
of bees in collecting nectar to secret honey. The proposed 
method is applied to solve the pseudoknotted RNA 
secondary structure prediction problem, which is a kind of 
combinatorial NP-complete problem [20]. 

The bees in colony deliberated for collecting nectar and 
secreting honey and they compromise in three bee groups: 
employed bees, unemployed bees (onlookers or scouts) and 
nurse bees, plus the food sources (flower patches 
profitability). The first two groups of honeybees (employed 
and unemployed) search for the last part which is the rich 
food sources. The third bee component takes the collected 
forage (nectar) from the first two groups by process of 
regurgitation. After that, the nurse bee starts making honey 
and royal jelly by a popular secretion honey process. The 
behavioral steps of the bees to carry out the forage collecting 
process, has been shown in Figure 4. Naturally, it can be 
described as follows: 

a) Employed bee (Forager): visits several food sources to 
collect the harvested crop, in each round-trip foraging 
journey. Nectar from many flower patches accumulate and 
store in the foragers' honey stomach (honey sac). 

b) Nurse bee: working inside the hive and she receive the 
collected nectar from the employed bee (forager) by 
regurgitation process. After that, the nurse bee starting makes 
honey or royal jelly from the associated mixed nectar by 
secreting invertase enzyme from the hypopharyngeal gland 
in her head. The corresponding enzyme assists to split the 



294 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 8, No. 8, 2010 



complex sugar (sucrose) to two simplifier sugars (fructose 
and glucose), which are principal of new well-done honey. 



B. The Classical ABC Algorithm 

The ABC algorithm is a new AI model, which has been 
stimulated by the collective behavior of the social honeybees 
based on swam intelligence. It uses multi-resource and multi- 
form to perform the job with full optimization [24]. The 
ABC algorithm originally is divided into three parts: 
employed bees, onlookers and scouts. The employed bee is a 
hard-worker part in the colony that responsible to collect 
food. Onlookers part is waiting inside the hive to decide on a 
forage source. Scouts is performed a general search to find 
the food resources. 



C. Hybrid ABC Algorithm for RNA Structural Prediction 

Our proposed ABCRna method is a hybrid model based 
on the PSO and it is derived from the original ABC model. 
This new derivation of the modified ABC is associated with 
a specific case corresponding to the pseudoknotted RNA 
secondary structure prediction problem. The worker bee 
(employed bee) works as an agent, visits many rich flower 
patches (artificial food sources) to collect the nectar. 
Thereafter, all collected nectar from many flowers stored in 
foragers' honey stomach, which will be a mixture of nectar 
from several food sources (many flowers). Then, the 
employed bee (forager) back to the hive from the foraging 
journey with the mixed nectar fills her honey stomach. In 
the hive, the forager submits the crop (collected nectar) to 
the nurse bee in a regurgitation process. Finally, the nurse 
bee now is ready to make honey from the corresponding 
mixture of gathered nectar that submitted by employed bee. 
The nurse bee starts secreting the honey or royal jelly 
associated with specific needs of the hive. Here, the final 
well-done honey is represented the good solutions for the 
RNA structural prediction problem. In another words, the 
concluding honey in mapping segment, stands for the more 
accurate pseudoknotted RNA secondary structure for a 
giving primary sequence. 

The central phase of the ABCRna method is a HoneyRna 
algorithm, which is a modified from ABC algorithm to solve 
the pseudoknotted RNA structural prediction method. This 
HoneyRna algorithm is illustrated in Figure 7 and computes 
in steps as follows: 

1: Initialize 
2: REPEAT 

3: Place the employed bee on her food sources (many flowers) 
4: Place the nurse bee on hive working to receive mixed nectar 
5: Secret enzyme to split the complex nectar to a simpler honey 
6: Fill the secreting food (honey & royal jelly) in the honeycomb 
7: Filter the well-done honey from extra water by evaporation 
8: Cap and seal the filled cell with food by adhesive wax 
9: UNTIL (Demanded food is met) 

In the modified ABC algorithm, each cycle of the 
collecting nectar and secreting honey process includes three 



steps: (i) the employed bee visits many flower patches in 
each round-trip of collecting nectar journey. All gathered 
nectar is stored in her honey stomach. The employed bee 
backs to the hive with holding the mixed nectar. Then, she 
will submit this mixture to the nurse bee. 

Moreover, the secretion process of the honey by nurse 
bee performs in many steps, as follows: 

a) The harvest of the forage (the mixed nectar), has been 
collected from many flowers. By mapping this phase with 
the RNA related issue, the predicted RNA secondary 
structure is collected from many existed RNA predicted 
methods, as illustrated in Figure 7. 

b) The nurse bee starts make honey by secreting invertase 
enzyme from the gland in her head. This enzyme simplifies 
the sucrose which is a complex sugar in the nectar to two 
types (fructose and glucose) of simpler sugars, which are 
composed the well done honey. By mapping this with RNA 
structural problem, there is an agent program, which is 
working like that enzyme. This function re-constructs the 
entire secondary structure of RNA sequence with 
pseudoknots from many parts. 



V. Benchmark Tests and Results 

We evaluated ABCRna on different types of 
pseudoknotted RNA classes. The proposed method is built to 
predict the RNA secondary structure with pseudoknots. The 
comparisons of the ABCRna results have been performed by 
measuring the accuracy of its outputs to the outputs that has 
been achieved from FlexStem [29], HotKnots [30] and 
pknotsRG [19]. These accuracy measurements compromise 
three statistical notations: (Sensitivity S, Selectivity P and F- 
measure). They can be calculated by applying the following 
formulas, which derived from [63]: 



TP 



Sensitivity S = 
Specificity P = 



F - measure = 2 x P x ( J x 100, 

7> + S 



x 100 
TP + FN 


(1) 


TP 




x 100 
TP + FP 


(2) 


S 





(3) 



where TP is represented the True Positive, which denotes the 
number of base pairs that are predicted correctly and 
presented in the known native structure. FN is represented 
the False Negative, which counts the base pairs that are 
presented in the known native structure, but they are not 
reported in the predicted structure. FP is represented the 
False positive, which denotes the number of base pairs, 
presented in the native known structure, but they are not in 
predicted structure. F-measure is a single measure that 
combines both sensitivity and specificity of the predictor 
algorithm in a unique performance measure. 
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(a) Native structure of TMV of length 412 nt. 
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(c) Structure of TMV predicted by FlexStem, 2008 [29]. 



■ M'H-u--. 



* j-c-*-c-c-u-£j.r 

<, M-A G U G 



C 
I 

9 u 

M 

c 9? 

u 9 - 9 

A , 99 




"U^3 S 



G c 



A^ c 



I 







(b) Structure of TMV predicted by our ABCRna method. 
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(d) Structure of TMV predicted by HotKnots, 2005 [30]. ^ 
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(e) Structure of TMV predicted by PknotsRG,2004 [19]. 



Figure 8. Plots of qualitative comparison analysis of TMV structures: (a) The known native secondary structure of TMV molecule, (b) Secondary structure 
predicted by our proposed ABCRna method, with highest excellent sensitivity of (92.9%) and specificity (95.6%). (c) Secondary structure predicted by FlexStem 
(sensitivity of 44.3% and specificity 44.9%). (d) Secondary structure predicted by HotKnots (sensitivity of 67.1% and specificity 81.0%). (e) Secondary structure 

predicted by pknotsRG (sensitivity of 60.0% and specificity 66.7%). 
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Here, the comparison analysis of the outputs are 
performed between our proposed ABCRna method against to 
the FlexStem [29], HotKnots [30] and pknotsRG [19]. One 
example of this comparison pricess uses the RNA sequence 
tobacco mosaic virus (TMV) from 3'UTR type [64]. The 
length of TMV is equal 214 nucleotides (nt), which its 
accession number "J02415". Our proposed ABCRna method 
obtained the highest results, Sensitivity (S = 92.9%) and 
Specificity (P = 95.6%), which are measured according to 
the known native structure of TMV molecule. The sensitivity 
and specificity of FlexStem [29], HotKnots [30] and 
pknotsRG [19], are listed in the legend of the illustrative 
Figure 8, respectively. Finally, the Figure 8 depicts a 
qualitative comparison analysis among the output of our 
ABCRna and the best result of all others methods from the 
literature. This comparison analysis is applied on the 
secondary structure of TMV RNA molecule, which its 
images are produced by Pseudo Viewer software tool [65]. 
NUPACK [66] and pknotsRE [67] methods cannot predict 
the secondary structure of RNA sequences in larger than the 
length of 200 nt and 150 nt, respectively. The reason behind 
that the both algorithms NUPACK [66] and pknotsRE [67], 



require an enormous amount of memory (RAM) and run in 
exponential time. To reach to a fair comparison, the outputs 
for these scenarios put out of the result. Also, all five existing 
methods (FlexStem [29], HotKnots [30], pknotsRG [19], 
NUPACK [66] and pknotsRE [67]), have been implemented 
in the same machine, a PC Ubuntu 10.04 64-bit Linux OS, 
with AMD Phenom-II 810 2.6-GHz Quad-Core processor 
and Dual Channel 4GB (2x2GB) DDR2-800 Memory 
(RAM). 

Table 1 summarizes the final comparison analysis of the 
results among the predicted RNA structures from our 
proposed ABCRna method and the best ones from FlexStem 
[29], HotKnots [30], pknotsRG [19], NUPACK [66] and 
pknotsRE [67] methods. The comparison process has been 
done in respects to the three accuracy metrics listed in 
Equations (1,2 and 3). The evaluation of these comparative 
results were performed and verified according to the standard 
native structures of each RNA molecule. The analyses show 
the results of the ABCRna method are significantly better 
than the results of other methods from literature, in terms of 
sensitivity, specificity and F-measure. 



TABLE I. Comparison Accuracy Metrics The Structural Results of Base Pairs among the ABCRna method and other methods. 



RNA sequence 


ABCRna (2010) 


FlexStem (2008) 


HotKnots (2005) 


pknotsRG (2004) 


NUPACK (2003) 


pknotsRE (1999) 


Seq. ID, Ref. 


RNA 

class 


Accession 
Number 


Length 
(nt) 


S 


P 


F 


S 


P 


F 


S 


P 


F 


S 


P 


F 


S 


P 


F 


S 


P 


F 


HDV-It_g [68] 


Ribozymes 


X04451 


87 


90.6 


93.5 


92.1 


84.4 


79.4 


81.8 


71.9 


82.1 


76.7 


90.6 


93.5 


92.1 


62.5 


62.5 


62.5 


81.3 


81.3 


81.3 


BMV [69] 


3'-UTR 


J02042 


145 


73.8 


62.0 


67.4 


45.2 


38.8 


41.8 


31.0 


31.7 


31.3 


47.6 


43.5 


45.5 


45.2 


44.2 


44.7 


45.2 


44.2 


44.7 


FMDV-A [70] 


5'-UTR 


AY593751 


165 


83.3 


54.1 


65.6 


66.7 


41.0 


50.8 


12.5 


07.3 


09.2 


66.7 


41.0 


50.8 


62.5 


34.9 


44.8 


* 


* 


* 


TMV [71] 


3'-UTR 


J02415 


214 


92.9 


95.6 


94.2 


44.3 


44.9 


44.6 


67.1 


81.0 


73.4 


60.0 


66.7 


63.2 


* 


* 


* 


* 


* 


* 



The highest results are in bold, and "*" denotes that the algorithm is unable to complete the run. 



IV. Conclusion and Future Work 

This paper presented a novel hybrid method for solving 
RNA secondary structure with pseudoknot functional 
classes. This hybrid method includes ABC model as a global 
optimization method, hybridized with CBR as a local 
optimization technique. The proposed method used the 
existing results from KnotSeeker and UNAFold to generate a 
secondary structure of RNA includes pseudoknots, by using 
an existing cases. 

Three evaluation mechanisms are used to measure the 
efficiency and performance of proposed method comparing 
to others from literature. The sensitivity, specificity and F- 
measure metrics showed that successful outcomes have been 
recorded. Furthermore, three different comparative methods 
are used in order to compare the obtained results. The 
proposed ABCRna hybrid method outperformed other 
comparators in almost all standard benchmarks. Note that the 
factor for comparison is the real native structure. 



We believe that the proposed hybrid method have a high 
potential with a great efficiency for the problems solved by 
RNA community. This worthwhile domain is pregnant with 
several future research directions such as: further study cases 
in CBR, different global optimization, different factors of 
analysis and hybridize more RNA prominent methods. 
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Abstract— Image compression is an inevitable solution for 
image transmission since the channel bandwidth is limited and 
the demand is for faster transmission. Storage limitation is also 
forcing to go for image compression as the color resolution and 
spatial resolutions are increasing according to quality 
requirements. JPEG compression is a widely used compression 
technique. JPEG compression method uses linear quantization 
and threshold values to maintain certain quality in an entire 
image. The proposed method estimates the vitality of the block 
of the image and adapts variable quantization and threshold 
values. This ensures that the vital area of the image is highly 
preserved than the other areas of the image. This hybrid 
approach increases the compression ratio and produces a 
desired high quality output image. 



Key words- Image Compression, Edge-Detection, 
Segmentation. Image Transformation, JPEG, 
Quantization. 

I. INTRODUCTION 

Every day, an enormous amount of 
information is stored, processed, and transmitted digitally. 
Companies provide business associates, investors, and 
potential customers with financial data, annual reports, 
inventory, and product information over the Internet. And 
much of the online information is graphical or pictorial in 
nature; the storage and communications requirements are 
immense. Methods of compressing the data prior to storage 
and/or transmission are of significant practical and 
commercial interest. 

Compression techniques fall under two broad 
categories such as lossless[l] and lossy[2][3]. The former 
is particularly useful in image archiving and allows the 
image to be compressed and decompressed without losing 
any information. And the later, provides higher levels of 
data reduction but result in a less than perfect reproduction 
of the original image. Lossy compression is useful in 
applications such as broadcast television, 
videoconferencing, and facsimile transmission, in which 
certain amount of error is an acceptable trade-off for 
increased compression performance. The foremost aim of 
image compression is to reduce the number of bits needed 
to represent an image. In lossless image compression 
algorithms, the reconstructed image is identical to the 
original image. Lossless algorithms, however, are limited 
by the low compression ratios they can achieve. Lossy 
compression algorithms, on the other hand, are capable of 
achieving high compression ratios. Though the 
reconstructed image is not identical to the original image, 



lossy compression algorithms obtain high compression 
ratios by exploiting human visual properties. 

Vector quantization [4], [5] , wavelet transformation 
[1] , [5]-[10] techniques are widely used in addition to 
various other methods[ll]-[17] in image compression. The 
problem in lossless compression is that, the compression 
ratio is very less; where as in the lossy compression the 
compression ratio is very high but may loose vital 
information of the image. Some of the works carried out in 
hybrid image compression [18]-[19] incorporated different 
compression schemes like PVQ and DCTVQ in a single 
image compression. But the proposed method uses lossy 
compression method with different quality levels based on 
the context to compress a single image by avoiding the 
difficulties of using side information for image 
decompression in [20]. 

The proposed method performs a hybrid 
compression, which makes a balance on compression ratio 
and image quality by compressing the vital parts of the 
image with high quality. In this approach the main subject 
in the image is very important than the background image. 
Considering the importance of image components, and the 
effect of smoothness in image compression, this method 
segments the image as main subject and background, then 
the background of the image is subjected to low quality 
lossy compression and the main subject is compressed with 
high quality lossy compression. There are enormous 
amount of work on image compression is carried out both 
in lossless [1] [14] [17] and lossy [4] [15] compression. 
Very few works are carried out for Hybrid Image 
compression [18]-[20]. 

In the proposed work, for image compression, the 
edge detection, segmentation, smoothing and dilation 
techniques are used. For edge detection, segmentation 
[21], [22] smoothing and dilation, there are lots of work has 
been carried out [2], [3]. A novel and a time efficient 
method to detect edges and segmentation used in the 
proposed work are described in section II, section III gives 
a detailed description of the proposed method, the results 
and discussion are given in section IV and the concluding 
remarks are given in section V. 

II. BACKGROUND 
A. JPEG Compression 

Components of Image Compression System (JPEG). 

Image compression system consists of three closely 
connected components namely 

• Source encoder (DCT based) 
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• Quantizer 

• Entropy encoder 

Figure 2 shows the architecture of the JPEG encoder. 
Principles behind JPEG Compression. A common 

characteristic of most images is that the neighboring pixels 
are correlated and therefore contain redundant information. 
The foremost task then is to find less correlated 
representation of the image. Two fundamental components 
of compression are redundancy and irrelevancy reduction. 
Redundancy reduction aims at removing duplication from 
the signal source. Irrelevancy reduction omits parts of the 
signal that will not be noticed by the signal receiver, namely 
the Human Visual System (HVS). The JPEG compression 
standard (DCT based) employs the use of the discrete cosine 
transform, which is applied to each 8x8 block of the 
partitioned image. Compression is then achieved by 
performing quantization of each of those 8x8 coefficient 
blocks. 

Image Transform Coding For JPEG Compression 
Algorithm. 

In the image compression algorithm, the input image is 
divided into 8-by-8 or 16-by-16 non-overlapping blocks, 
and the two-dimensional DCT is computed for each block. 
The DCT coefficients are then quantized, coded, and 
transmitted. The JPEG receiver (or JPEG file reader) 
decodes the quantized DCT coefficients, computes the 
inverse two-dimensional DCT of each block, and then puts 
the blocks back together into a single image. For typical 
images, many of the DCT coefficients have values close to 
zero; these coefficients can be discarded without seriously 
affecting the quality of the reconstructed image. A two 

dimensional DCT of an M by N matrix A is defined as 
follows 

M4N4 ;r(2m + l)p ;r(2n + l)q 

Bpq = (XpOlq^ 2j ^ mn C0S COS ^ T , < p < M-l 



2M 



2N 



< q < N-l 



where 



a P = 



a q 



= \ i/Vm , p = o 

/2/M ,l<p<M-l 

= \ 1/Vn ,q = 

2/N , l<q<N-l 



The DCT is an invertible transformation and its inverse is 
given by 



SS n ;r(2m + l)p ;r(2n + l)q 

Amn = a P (Xq > > Bpq COS — ^OS — — , < p < M-l 

£i^o 2M 2N 

< q < N-l 



Where 



a P = 



a q = 



i/Vm , p = o 

v/2/M , l<p<M-l 

1/VN ,q = 
v/2/N , l<q<N-l 



The DCT based encoder can be thought of as essentially 
compression of a stream of 8 X 8 blocks of image samples. 
Each 8X8 block makes its way through each processing 
step, and yields output in compressed form into the data 
stream. Because adjacent image pixels are highly correlated, 
the 'forward' DCT (FDCT) processing step lays the 
foundation for achieving data compression by concentrating 
most of the signal in the lower spatial frequencies. For a 
typical 8X8 sample block from a typical source image, 
most of the spatial frequencies have zero or near-zero 
amplitude and need not be encoded. In principle, the DCT 
introduces no loss to the source image samples; it merely 
transforms them to a domain in which they can be more 
efficiently encoded. 

After output from the FDCT, each of the 64 DCT 
coefficients is uniformly quantized in conjunction with a 
carefully designed 64 - element Quantization Table. At the 
decoder, the quantized values are multiplied by the 
corresponding QT elements to recover the original 
unquantized values. After quantization, all of the quantized 
coefficients are ordered into the "zig-zag" sequence as 
shown in figure 1 . This ordering helps to facilitate entropy 
encoding by placing low- frequency non-zero coefficients 
before high-frequency coefficients. The DC coefficient, 
which contains a significant fraction of the total image 
energy, is differentially encoded. 



AC 



DCC 




AC 



AC 7 



Figure 1 Zig-Zag Sequence 

The JPEG decoder architecture is shown if figure 3 
which is the reverse procedure described for compression. 

B. Segmentation 

Let D be a matrix of order m x n, represents the 
image of width m and height n. The domain for Di,j is 
[0..255] , for any i=l..m, and any j=l.. n. 

The architecture of segmentation using histogram 
is shown in figure 4. To make the process faster the high 
resolution input image is down sampled 2 times. When the 
image is down sampled each time the dimension is reduced 
by half of the original dimension. So the final down sampled 

m n_ 

image (D) is of the dimension 4 x 4 . The down sampled 
image is smoothed to get smoothed gray scale image using 
equation (1). 

S 1 , J =l/9(D,g. 1 +D,g+D, g+1 +D 1;J . 1 +D 1 , J +D l5j+1 +D 1+U -i 
+Di +u +Di +u+ i) ...(1) 
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Figure -4 Classifier 

The histogram(H) is computed for the gray scale image(S). 
The most frequently present gray scale value (Mh) is 
determined from the histogram by equation (2) and is shown 
as indicated by a line in figure 5. 



Mh = arg{max(H(x))} 



(2) 



The background value of the images is having the highest 
frequency in the case of homogenous background. In order 
to surmise background textures a range of gray level 
values are considered for segmentation. The range is 
computed using the equations (3) and (4). 



L = max(Mh-30,0) 
U = min( Mh + 30,255) 



...(3) 
...(4) 



The gray scale image S is segmented to detect the 
background area of the image using the function given in 
equation (5) 




Figure - 5 



After processing the pixel values for background area is 1 in 
the binary image B. To avoid the problem of over 
segmentation the binary image is subjected to sequence of 
morphological operations. The binary image is eroded with 
smaller circular structural element (SE) to remove smaller 
segments as given in equation (6). 



B = BQSE 



..(6) 
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Then the resultant image is subjected to morphological 
closing operation with larger circular structural element as 
given in equation (7). 



B = B^SE 



...(7) 



III. PROPOSED HYBRID JPEG COMPRESSION. 

The input image is initially segmented into 
background and foreground parts as described in section 
II.B Then the image is divided into 8x8 blocks and DCT 
values are computed for each block. The quantization is 
performed according to the predefined quantization table. 
The quantized values are then reordered based on zig-zag 
ordering method described in section II A. The lower values 
of AC coefficients are discarded from the zig-zag ordered 
list by comparing the threshold value selected by the 
selector as per the block's presences identified by the 
classifier. If the block is present in foreground area then the 
threshold is set to a higher value by the selector, otherwise a 



10. Quantize the DCT coefficients 

11. Discard lower quantized values based on the 
threshold value selected by the selector. 

12. Compress remaining DCT coefficients by Entropy 
Encoder 

The architecture of the proposed method is shown in figure 
6. The Quantization Table is a fixed classical table derived 
from empirical results. The Quantizer quantizes the DCT 
coefficients computed by FDCT. The classifier identifies the 
class of each pixel by segmenting the given input image. 
The selector and limiter works together to find the discard 
threshold limit. The entropy encoder creates compressed 
code using the Code Table. The compressed image may be 
stored or transmitted faster than the existing method. 

IV. RESULTS AND DISCUSSION 

The Hybrid JPEG Compression method is implemented 
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Figure 6. Hybrid JPEG compression method 



lower value for threshold is set by the selector. After 
discarding insignificant coefficients the remaining data are 
compressed by the standard entropy encoder based on the 
code table. 

Algorithm 

1 . Input High Resolution Color image. 

2. Down sample the input image 2 times. 

3. Convert the down sampled image to gray scale 
image (G). 

4. Find histogram (H) of the gray scale image. 

5. Find the lower (L) and upper (U) gray scale value 
of background area. 

6. Find Binary segmented image (B) from the gray 
scale image (G) 

7. Up sample Binary image (B) two times. 

8. Divide the input image into 8x8 blocks 

9. Find DCT coefficients for each blocks 



according to the description in section III and tested with a 
set of test images shown in figure 8. The results obtained 
from the implementation of the proposed algorithms are 
shown in figures 7, 9,10 and table I. Figure 7.a shows the 
original input image. In Figure 7.b the segmented object 
and background area is discriminated by black and white. 
The compressed bit rates of the twelve test images are 
computed and tabulated in table 1 . The low quality (LQ) and 
high quality (HQ) JPEG compression is performed and the 
corresponding compression ratios(CR) and PSNR values 
are tabulated. The PSNR is higher for HQ and CR is higher 
for LQ. The Hybrid JPEG compression performs HQ 
compression on main subject area and LQ compression on 
background area thus the PSNR value at main subject area is 
the same for Hybrid JPEG and HQ JPEG. Figure 9 shows 
the comparison of normalized CRs of Hybrid JPEG and HQ 
JPEG, it is observed that almost all of the images are 
compressed better than classical JPEG compression. Figure 
10 shows how well the compression ratio is increased than 
the classical JPEG compression method. 
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a)Input Image 




b)Segmented Main Subject Area 
Figure- 7 Input /Output 

Table -1 Compression Ratio and PSNR values obtained by Hybrid JPEG and JPEG 



Image 


LQ 


Hybrid 


Hybrid/HQ 


HQ 


CR2-CR3 


CR1 


PSNR1 


CR2 


PSNR2 


PSNR@ 

MainSubject 


CR3 


PSNR3 


3 


26.00 


21.52 


7.46 


27.82 


27.84 


7.46 


27.82 


0.0000 


5 


25.29 


22.09 


6.80 


28.87 


28.93 


6.80 


28.87 


0.0004 


10 


23.83 


20.78 


5.41 


28.09 


28.13 


5.41 


28.09 


0.0031 


4 


24.33 


22.08 


6.61 


31.13 


31.20 


6.60 


31.13 


0.0068 


11 


27.75 


25.33 


8.62 


35.97 


36.25 


8.61 


36.01 


0.0198 


9 


24.92 


21.62 


6.58 


30.45 


30.56 


6.56 


30.51 


0.0204 


1 


27.54 


23.03 


8.40 


29.37 


29.40 


8.37 


29.37 


0.0276 


7 


24.85 


17.71 


4.01 


23.38 


23.43 


3.92 


23.38 


0.0911 


8 


24.63 


21.97 


6.73 


31.05 


31.13 


6.64 


31.10 


0.0921 


12 


29.18 


22.73 


5.92 


28.70 


29.07 


5.61 


28.93 


0.3111 


6 


26.47 


20.12 


5.54 


25.31 


25.33 


5.19 


25.30 


0.3449 


2 


22.84 


20.40 


7.91 


27.93 


28.21 


7.25 


28.06 


0.6541 
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Figure - 8 Test Images (1-12 from Left to Right and Top to Bottom) 
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Figure - 9 Normalized Compression Ratio Obtained for Test Images 

Increased Compression Ratio by Hybrid JPEG Compression 



0.7 - 














o 






ir o 4 

c 

o 

'55 

w 














E 03 

o 

o 
















- 


n , 












, i — i , . m , 



10 11 



Images 



Figure - 10 Increased Compression Ratios by Hybrid Compression. 

V. CONCLUSION the background area is lower in Hybrid JPEG method which 

^ . .■ r-TT i ■ i t™-^ .ii- is acceptable, since the background area is not vital. The 

The compression ratio or Hybrid JPEG method is TT . ./ ' ., A . .: ,, r . ... , 

i-i .i tt^t-^ .ii- .1 ™n/ r, A T Hybrid JPEG method is suitable lor imagery with larger 

higher than JPEG method in more than 90% ol test cases. In . .-ii i -, -, . ■ i , ri ■ • -i i 

^ ° t i .i tt i -1^1-^ i TT^i-^ .i i • trivial background and certain level ol loss is permissible, 

the worst case both Hybrid JPEG and JPEG method gives 

the same compression ratio. The PSNR value at the main 

subject area is same for both methods. The PSNR value at 
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